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I.  Introduction 


Polygraph  examinations  are  the  most  widely  used  method  to  distinguish  between  truth  and 
deception.  In  a  Polygraph  examination  a  person  is  connected  to  a  special  instrument 
called  a  Polygraph  which  records  several  physiological  signals  such  as  blood  pressure, 
Galvanic  Skin  Response,  and  respiration.  The  subject  is  asked  a  set  of  questions  by  an 
examiner.  By  looking  at  these  signals  the  examiner  is  able  to  determine  the  reactions  of 
the  subject  to  the  questions  and  decide  whether  the  person  was  truthful  or  deceptive  in 
answering  each  question.  The  problem  with  human  classification  of  Polygraph  tests  is  that 
the  outcome  depends  on  the  examiner's  experience  and  personal  opinion.  Automatic 
scoring  of  Polygraph  tests  has  been  a  subject  of  extensive  research.  Several  methods  for 
Polygraph  classification  have  been  studied  which  are  mostly  based  on  statistical 
classification  techniques. 

In  this  study  two  main  goals  were  presented.  The  first  goal  was  finding  appropriate 
features  which  have  physiological  basis.  The  second  purpose  was  trying  a  new 
classification  method  based  on  fuzzy  set  theory.  The  advantage  of  using  fuzzy  logic  is  that 
the  it  does  not  simply  assigns  each  input  to  one  of  the  classes,  but  it  gives  the  possibility  of 
belonging  of  an  input  to  each  class. 

Digitized  Polygraph  data  used  in  this  project  were  collected  from  vanous  police  stations. 
The  data  files  were  organized  according  to  the  test  format  used  and  were  decoded  to 
ASCII  format  so  they  can  be  read  by  Matlab.  Preprocessing  and  feature  extraction 
routines  were  implemented  in  the  Matlab  language.  Three  sets  of  files  were  chosen,  each 
one  of  them  contained  50  deceptive  and  50  non-deceptive  files.  These  files  are  listed  in 
Table  10  in  Appendix  A.  A  set  of  features  were  selected  based  on  physiological  reactions, 
and  the  feature  vectors  for  every  file  in  each  set  were  found.  Different  classification 
methods  were  studied  and  a  Fuzzy  K-nearest  neighbor  classifier  was  selected. 

Significance  of  each  feature  was  examined  according  to  the  clustering  and  correct 
classification  obtained  by  using  that  individual  feature.  Thirty  features  were  selected  as 
the  final  set  of  features  and  a  subset  of  combinations  of  2  to  4  of  these  features  were 
examined  to  study  the  effects  of  combining  the  features  on  classification  results.  The 
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combination  that  produced  the  best  classification  for  all  three  sets  on  the  average  was 
selected  and  the  effects  of  changing  the  classifier  parameters  on  classification  was  studied. 
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II.  Polygraphs* 

A  polygraph  examination  is  the  most  popular  method  used  to  determine  if  an  individual  is 
being  truthful  or  deceptive.  During  an  examination,  a  subject  is  asked  a  series  of  questions 
and  the  physiological  responses  to  the  questions  are  recorded  using  a  polygraph.  The 
three  physical  responses  currently  obtained  from  a  polygraph  examinations  are  blood 
pressure,  respiration,  and  skin  conductivity.  Polygraph  charts  are  usually  analyzed  by  a 
human  interpreter  for  evidence  of  truth  or  deception;  however,  computer  algorithms  are 

now  being  used  to  verify  results  [1][2]. 


IL1.  History 

The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception  occurred 
around  1895  [3],  That  was  the  year  that  Caesar  Lombroso  published  the  results  of  his 
experiments  in  which  a  hydrosphygmograph  was  used  to  measure  the  blood  pressure-pulse 
changes  of  criminals  in  order  to  determine  whether  or  not  they  were  deceptive.  Although 
the  hydrosphygmograph  was  originally  intended  to  be  used  for  medical  purposes, 

Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may  have  been  the  first 
to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a  suspect  a  series  of 
photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the  suspect  did  not 
react  more  to  the  victims  picture  than  the  pictures  of  the  other  children,  Lombroso 
concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and  therefore  was  not 

the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by  measuring 
recorded  respiration  tracings  [4],  He  found  that  if  the  length  of  inspiration  were  divide  by 
the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before  lying  and  also 
before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson  constructed  an 
instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and  respiration 
during  an  examination  [3][4].  Larson  reported  accurate  results  which  prompted  Leonarde 
Keeler  to  construct  a  better  version  of  this  instrument  in  1926  [3]  [4], 


*  This  section  is  exerpted  from  [17] 
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The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the  century.  It's 
usefulness,  however,  did  not  become  evident  until  the  1930's  during  which  time  several 
articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New  York  [4]. 

In  these  articles  he  reports  over  90  criminal  cases  in  which  examination  using  the  galvanic 
skin  response  had  all  been  successful  and  confirmed  by  confession  or  supplementary 
evidence.  The  usefulness  of  the  galvanic  skin  response  prompted  Keeler  to  add  an 
galvanometer  to  his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler 
Polygraph  recorded  blood  pressure-pulse,  respiration,  and  galvanic  skin  response  [3], 


II.2  Modem  Test  Formats 


The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant  questions 
about  an  issue,  control  questions  that  provide  a  physical  response  for  comparison,  and 
irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response  for  comparison 
[1][4],  Three  general  types  of  test  formats  are  in  use  today.  These  are  Control  Question 
Tests,  Relevant-Irrelevant  Tests,  and  Concealed  Knowledge  Tests.  Each  of  the  general 
test  formats  may  have  a  number  of  more  specific  variations.  Each  test  consists  of  two  to 
five  charts  containing  a  prescribed  series  of  questions.  The  test  format  that  is  used  in  an 
examination  is  determined  by  the  test  objective  [3] [4], 

The  concealed  knowledge  test,  also  called  peak  of  tension  test,  is  used  when  facts  about  a 
crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a  subject 
would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example,  if  a  gun 
was  used  in  a  crime  and  the  public  did  not  know  the  caliber,  an  examiner  could  ask  a 
suspect  if  it  was  a  22  caliber ,  a  38  caliber,  or  a  9  mm.  If  the  gun  used  was  a  9  mm  and 
the  suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of 
deception. 

A  control  question  test  is  often  used  in  criminal  investigations.  In  this  type  of  test  a  senes 
of  relevant,  irrelevant,  and  control  questions  are  asked.  A  relevant  question  is  one  which 
is  specific  to  the  crime  being  investigated.  For  example,  -  Did  you  steal  the  money?".  A 
control  question  is  designed  to  make  the  subject  feel  uncomfortable.  It  is  not  specific  to 
the  crime  being  investigated  however  it  may  be  related  in  an  indirect  way.  A  control 
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question  that  could  follow  the  relevant  question  stated  above  is  "Have  you  ever  taken 
anything  that  did  not  belong  to  you?".  The  control  questions  are  compared  to  the  relevant 
questions  and  if  the  responses  to  the  relevant  questions  are  greater,  the  subject  is  usually 
classified  as  deceptive.  Irrelevant  questions  are  used  as  buffers.  Examples  of  irrelevant 
questions  are  "Are  the  lights  in  this  room  on?"  or  "Is  today  Monday?". 

Relevant-Irrelevant  tests  are  usually  used  to  test  people  trying  to  obtain  security  clearance 
or  get  a  job.  In  this  test,  relevant  questions  are  compared  to  irrelevant  questions.  Very 
few  control  questions  are  asked.  The  purpose  of  control  questions  in  this  test  is  to  make 
sure  that  the  subject  is  capable  of  reacting  at  all. 


II.3  Present  Day  Equipment 


The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in  1945 
and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [1][1  !]•  The  Reid 
polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biological  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 

The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way.  Both  machines  record  the  same  biological  signals 
using  standard  methods.  Blood  pressure  is  measured  by  placing  a  standard  blood  pressure 
cuff  on  the  arm  over  the  brachial  artery.  Respiration  is  monitored  by  placing  rubber  tubes 
around  the  abdominal  area  and  the  chest  of  the  subject.  This  results  in  two  signals,  an 
upper  and  lower  respiratory  signal.  Skin  conductivity  is  measured  by  placing  electrodes 

on  two  fingers  of  the  same  hand. 
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III.  Feature  Extraction  and  Classification 


III.1  Introduction 


The  problem  of  Classification  of  Polygraph  data  like  other  pattern  recognition  problems 
can  be  considered  of  consisting  of  several  main  stages.  Figure  [1]  shows  these  stages  and 
the  relationship  between  them.  At  the  beginning  data  is  preprocessed  so  that  noise  and 
redundancies  are  removed  from  data  and  feature  extraction  can  be  done  more  accurately. 
The  next  stage  is  feature  extraction.  In  this  step  data  is  read  and  appropriate  features  are 
extracted  from  it.  This  is  a  very  important  step  in  all  pattern  recognition  problems, 
because  the  purpose  of  pattern  recognition  is  finding  similarities  in  data  that  belong  to  the 
same  class,  and  features  are  elements  that  represent  these  similarities.  Therefore,  a  good 
set  of  features  can  lead  to  good  classification  whereas  a  satisfactory  result  cannot  be 
achieved  with  an  inappropriate  set  of  features.  Having  a  set  of  features,  the  next  step  is  to 
use  a  method  to  classify  data  using  these  features.  These  steps  as  applied  to  Polygraph 
classification  are  described  in  more  details  in  the  following  sections. 


POLYGRAPH  CLASSIFICATION 


Figure  1 


III.2.  Preprocessing 


Polygraph  data  consists  of  signals  from  four  different  channels:  galvanic  skin  response 
(GSR),  blood  pressure,  higher  respiration,  and  lower  respiration.  First  blood  pressure 
signal  was  decomposed  into  a  high  frequency  component  showing  heart  pulse,  and  a  low 
frequency  component  showing  blood  volume.  Derivative  of  the  blood  volume  channel 
was  taken  and  used  as  another  channel.  These  six  derived  signals  were  detrended  and 
filtered.  For  more  details  on  preprocessing  refer  to  [17]. 
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III.3.  Feature  Extraction 


In  this  step  appropriate  features  are  selected  and  extracted.  Feature  extraction  is  itself 
divided  into  several  steps.  Figure  [2]  shows  different  stages  involved  in  feature  extraction. 

By  feature  gathering  we  mean  selecting  features  that  might  have  useful  information  in 
them.  Feature  Combination  is  a  special  step  in  polygraph  classification.  In  this  step 
features  derived  for  different  questions  in  a  test  are  combined  to  build  a  single  feature, 
feature  selection  is  a  step  in  which  a  small  number  of  features  is  selected  from  the  main 
feature  set  to  be  used  in  final  classifier  section. 


Figure  2 
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III.3.1.  Feature  Gathering 


Features  that  possibly  convey  some  information  in  them  were  selected  and  extracted  in  this 
stage.  Literature  about  Polygraph  were  studied  and  several  Polygraph  examiners  were 
interviewed  to  find  out  what  had  been  done  about  this  problem  and  what  characteristics  in 
a  signal  are  used  as  indicators  of  truth  or  deception.  In  general  features  are  divided  into 
three  main  groups,  time  domain  features,  frequency  domain  features  and  correlation 
features.  Time  domain  features  are  mostly  standard  characteristics  like  mean,  standard 
deviation,  median  and  so  on.  Some  more  specific  time  domain  features  were  also  added, 
such  as  the  ratio  between  inhalation  and  exhalation.  Auto  Regressive  parameters  were 
also  extracted  and  tried  as  features.  To  extract  each  feature  for  each  question  a  time 
frame  was  considered  that  started  with  a  specific  delay  after  each  question  was  asked  and 
lasted  for  a  specific  amount  of  time.  Different  time  frames  were  used  for  different 
channels  because  each  channel  represents  a  different  physiological  parameter.  Frequency 
domain  features  include  fundamental  frequency,  magnitude  of  power  spectral  density  at 
fundamental  frequency,  coherency  at  fundamental  frequency  and  so  on.  Figure  3  shows 
the  feature  gathering  and  the  decisions  that  involved  in  this  step. 


Figure.  3 
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For  every  question  in  a  test  93  features  were  selected  and  extracted  .  Also  6  Integrated 
Spectral  Density  features  were  used  which  directly  compare  each  relevant  question  to  the 
nearest  control  question.  The  total  number  of  features  derived  for  each  test  was  : 

93x10+6x5  =  960 

This  was  repeated  for  all  the  tests  in  feature  sets  1,  2  and  3.  The  results  of  each  set  were 
saved  in  a  960x 100  matrix  called  the  M  matrix. 

For  a  detailed  description  of  time  domain  features  and  frequency  domain  features  refer 
respectively  to  [17]  and  [16]. 


III.3.2.  Feature  Combination 


As  mentioned  earlier  each  feature  is  extracted  for  all  questions  in  a  test,  that  is  for 
relevant,  irrelevant,  and  control  questions.  In  a  polygraph  test  responses  to  relevant 
questions  are  compared  to  responses  to  irrelevant  and  control  questions.  But  in  any  test 
there  are  several  questions  of  each  type  and  many  methods  can  be  used  to  combine  them. 
Figure  [4]  shows  different  methods  to  combine  the  features.  It  was  decided  not  to  use 
irrelevant  questions  in  this  study,  because  in  a  Controlled  Question  Polygraph  Test 
comparison  between  the  responses  to  relevant  and  control  questions  is  the  most  important 
factor.  For  most  of  the  features  seven  methods  were  tried  to  combine  features  of  different 
questions  in  a  test.  For  the  last  six  features  three  ways  to  combine  them  were  tried.  These 
methods  were  finding  the  average,  maximum  and  minimum  of  relevant-control  pairs.  The 
first  93  features  combined  in  seven  ways  and  six  integrated  spectral  density  features  were 
combined  in  three  ways  so  the  total  number  of  features  at  this  stage  was  equal  to: 

(93x7; +(6x3;  =  669 
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Figure  4 


m.3.3  Feature  Selection 

Feature  selection  was  done  in  two  independent  steps,  reduction  and  combination.  Figure 
[5]  shows  the  relationship  of  these  two  steps.  These  two  steps  are  explained  m  the 

following  two  sections. 


Figure.  5 
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BH.3.3.1  Feature  Selection  (Reduction) 


The  next  step  in  our  Feature  Extraction  was  to  reduce  the  number  of  features  to  a  number 
so  that  a  practical  algorithm  can  be  used  to  select  the  feature  set  from  them.  It  was 
decided  to  bring  down  the  number  of  features  from  669  to  30  at  this  step.  Two  different 
methods  were  chosen  to  test  the  features  one  at  time  to  find  the  best  30.  The  first  method 
was  using  the  KNN  classifier  to  classify  the  data  files  using  one  feature  at  a  time.  It  was 
decided  to  use  a  Fuzzy  version  of  K-nearest  neighbor  algorithm.  The  value  5  was  selected 
for  the  K  because  it  seemed  that  it  gave  better  results  than  the  other  values  for  1  feature 
classification .  Also  a  threshold  of  0.5  was  used  to  defuzzify  the  output  of  the  classifier. 
Refer  to  the  section  on  classification  for  the  reason  of  choosing  this  classifier.  The  second 
method  was  using  the  scatter  criterion  is  given  below. 

(1) 

mi  -  mean  of  class  i,  5,  =  standard  deviation  of  class  i 

This  criterion  measures  the  distance  between  the  means  of  the  two  classes,  normalized 
over  the  sum  of  the  variances.  Therefore  the  more  compactly  the  samples  in  each  class  re 
separated,  the  higher  will  be  the  value  of  J. 

The  two  methods  were  run  on  three  sets  of  data.  At  this  point  a  method  was  needed  to 
choose  the  features.  Different  methods  are  possible  for  this  step.  The  method  that  was 
followed  is  shown  in  figure  [6]  and  explained  below. 

At  first  the  results  of  KNN  and  scatter  criterions  were  averaged  for  3  sets  of  data  so  that 
features  that  work  well  for  all  data  sets  would  be  selected.  As  mentioned  in  an  earlier 
section  for  Basic  features  1  to  93,  7  features  and  for  the  features  94  to  99,  3  features  were 
derived.  Because  these  features  are  derived  from  one  basic  feature  and  are  strongly 
correlated,  it  was  decided  to  choose  only  one  from  them.  So  the  best  feature  from  these 
sets  of  3  or  7  was  selected,  and  the  results  were  sorted. 
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Two  sets  of  30  features  were  found  using  the  above  mentioned  criterions.  The  next  step 
was  choosing  30  features  from  these  60.  This  was  done  by  examining  the  tables  and 
selecting  the  features  that  showed  a  good  performance  in  both  cases  or  had  a  special 
physical  meaning. 

This  set  of  features  is  the  final  set  used  for  examining  and  selection.  Table  1  in  Appendix 
A  shows  these  features  with  their  corresponding  meaning,  channel  used  to  derive  the 
feature,  and  the  method  to  combine  the  features  for  different  questions. 


m.3.3.2  Feature  Selection  (Combination) 


The  number  of  features  was  reduced  to  30  in  the  Feature  Reduction  step.  This  number 
should  be  further  reduced  because  there  is  100  samples  in  each  data  file,  and  using  30 
features  in  a  classifier  might  give  very  good  results  for  that  particular  data  set,  but  it  won't 
be  able  to  generalize.  At  this  step  measuring  the  performance  of  individual  features  is  not 
a  very  logical  method.  Because  for  example  features  'A'  and  "B’  might  be  good  features 
individually,  but  combining  them  might  not  necessarily  give  better  results.  Whereas 
feature  'C'  that  might  not  be  a  very  good  feature  by  itself  might  improve  the  classification 

if  combined  with  feature  'A'. 

Therefore  the  combinations  of  the  features  should  be  examined.  Many  methods  are 
suggested  to  solve  this  problem.  The  most  basic  way  is  exhaustive  search.  That  is  trying 
all  the  combinations  for  these  features.  It  is  obvious  that  this  is  not  practical  when  the 
number  of  features  is  not  very  small.  For  example  choosing  10  or  less  features  from  a  set 
of  30  and  trying  all  the  different  combinations  needs 


10  10  OA/ 


=  108 


computations. 

The  method  that  was  chosen  was  to  start  with  all  the  combinations  of  two,  find  the  best  N 
ones  among  them,  and  use  only  these  combinations  to  combine  features  in  sets  of  3.  Then 
again  find  the  best  combinations  of  3  and  use  them  in  combinations  of  4  features. 

This  procedure  is  continued  until  satisfactory  results  are  gained  or  features  are  not 
improved  by  increasing  the  number  of  features.  Figure  [7]  shows  the  algorithm  for  this 

step. 
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All  pairwise  combinations  of  the  features  were  tried  to  see  the  classification  results.  The 
classifier  used  was  Fuzzy  K-nearest  neighbor  with  a  threshold  of  0.5,  and  K— 5.  This  was 
done  for  three  sets  of  features.  The  results  were  sorted  and  30  best  combinations  for  each 
set  were  found.  Also  the  results  of  classification  for  each  combination  for  the  3  sets  was 
averaged  and  the  30  combinations  that  gave  best  results  on  the  average  were  found. 

These  combinations  are  shown  in  Table  2  in  Appendix  A. 

It  was  decided  to  select  20  sets  of  pairwise  combinations  to  use  in  combinations  of  3. 
Results  for  sets  1-3  and  Average  were  studied  and  combinations  that  showed  a  good 
result  in  one  of  the  sets  or  had  a  good  average  were  selected.  Table  3  in  Appendix  A 
shows  these  combinations. 

The  same  steps  were  repeated  to  study  the  combinations  of  3  and  4  features.  The  results 
are  shown  in  Tables  4  and  6  in  Appendix  A.  Because  of  time  limitations  it  was  decided 
not  to  go  further  from  combinations  of  4  features. 
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m.3.4  Discussion  about  the  results: 


The  classification  results  improved  consistently  by  increasing  the  number  of  features  from 
one  to  four.  The  features  that  showed  the  best  result  for  the  three  sets  were  features  {5, 

9,  21,  23} with  81  percent  correct  classification.  These  features  represent  Maximum  Of 
GSR,  Difference  between  Maximum  and  Minimum  of  High  Cardio,  Maximum  of  Lower 
Respiratory,  and  the  Difference  between  Maximum  and  Minimum  of  Upper  Respiratory. 
These  features  show  approximately  the  same  classification  results  for  all  three  sets  which 

is  81  percent. 

Other  combinations  of  features  also  gave  comparable  results.  For  example  {5,  21,  23,  29} 
and  {5, 11, 21,  23},  and  (5, 10,21,23}.  Note  the  repetition  of  {5,  21,  23}.  Refer  to  the 
table  1  in  Appendix  A  for  a  meaningful  listing  of  the  features.  It  is  very  notable  that 
feature  sets  that  show  the  best  classification  results  has  features  that  come  from  different 
channels.  It  can  be  concluded  that  signals  from  different  physiological  channels  convey 
independent  information,  so  that  using  features  extracted  from  them  improves  the 
classification. 

Another  point  to  notice  is  that  data  set  three  shows  better  classification  results  than  the 
two  other  sets,  87  percent  versus  81  percent  for  the  sets  one  and  two.  The  feature  set  that 
gives  the  best  result  for  data  set  three  is  (9, 14,  19,  24}.  This  feature  set  gives  87.4 
percent  correct  classification  for  data  set  three.  The  feature  set  (5,  9,  21,  23}  that  gives 
the  best  classification  on  the  average,  has  approximately  the  same  results  for  all  three  sets, 
81  percent.  The  polygraph  tests  that  were  used  in  this  project  came  from  several  sources 
and  were  done  by  different  examiners  that  used  slightly  different  methods.  Fifty 
consecutive  tests  were  used  to  build  each  data  set.  So  it  is  possible  that  some 
characteristic  exists  in  the  deceptive  files  of  data  set  three  that  results  in  better 
classification.  This  is  a  matter  of  future  investigation. 


3-19 


III.4.  Classification 


The  classifier  is  the  final  stage  in  a  pattern  recognition  system.  The  inputs  to  the  classifier 
are  usually  a  set  of  feature  vectors.  The  classifier  ordinarily  assigns  each  input  to  one  of 
the  classes.  There  are  many  methods  to  design  a  classifier.  The  classifier  could  be 
designed  after  studying  the  distribution  of  samples  of  each  class,  or  a  learning 
classification  algorithm  can  be  implemented.  We  were  not  sure  about  the  shape  of 
clustering  and  the  distribution  of  samples  for  deceptive  and  non  deceptive  classes,  and  it 
was  possible  that  samples  for  one  class  cluster  around  more  than  one  point  in  space.  It 
was  decided  to  use  the  K-nearest  neighbor  classifier*  in  this  project  because  it  does  not 
explicitly  use  the  distribution  of  the  samples. 

One  of  the  characteristics  of  the  conventional  classification  methods  is  that  they  assign 
each  input  to  one  of  the  possible  classes  (crisp  Classification)  or  find  probability 
distributions  of  belongingnesses  of  the  inputs  to  the  classes.  While  the  way  that  humans 
think  and  classify  objects  is  fundamentally  different.  Each  object  can  be  considered  to 
belong  to  more  than  one  class  at  the  same  time,  and  there  are  degrees  of  belongingness  for 
each  class.  This  is  the  basic  idea  that  is  followed  in  Fuzzy  Logic.  It  was  decided  to  follow 
a  Fuzzy  Logic  based  classifier  in  this  project,  because  the  output  will  be  the  possibility  of 
deception  and  a  person  will  not  be  considered  completely  deceptive  or  non  deceptive. 

Conventional  K-nearest  neighbor  algorithm  and  a  Fuzzy  version  of  it  are  described  in  the 
following  two  sections. 


*  We  are  indebted  to  Professor  R.  Duda  for  suggesting  KNN  classifier. 
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m.4.1.  K-Nearest  Neighbor  Algorithm 


K-Nearest  neighbor  algorithm  is  a  supervised  classification  method.  There  is  no  need  for 
the  training  or  adjusting  the  classifier.  A  set  of  labeled  input  samples  is  given  to  the 
classifier.  When  a  new  sample  is  given  to  the  system,  it  finds  its  K  nearest  neighboring 
samples,  and  assigns  this  sample  to  the  class  that  the  majority  of  the  neighbors  belong  to. 

K  could  be  any  positive  integer.  When  K  is  set  to  1,  the  algorithm  is  called  the  nearest 
neighbor  algorithm.  In  this  case  each  new  sample  is  assigned  to  the  class  of  its  nearest 
neighbor.  If  K  is  greater  than  1,  it  is  possible  that  there  is  no  majority  class.  To  remove 
this  tie,  the  sum  of  the  distances  of  the  new  sample  to  its  neighbors  in  each  class  is 
computed  and  the  sample  is  assigned  to  the  class  that  has  the  minimum  distance.  The 
main  advantage  of  using  this  method  is  that  the  samples  of  each  class  are  not  needed  to 
cluster  in  a  pre  specified  shape.  For  example  for  a  two  class  classification,  the  K-nearest 
neighbor  classifier  can  still  give  very  good  results  if  the  samples  of  each  class  are  clustered 
in  two  distinct  points  in  the  space.  The  algorithm  for  the  K  nearest  neighbor  is  shown  in 
figure  8.  It  is  supposed  that  C  is  the  number  of  classes,  K  is  the  number  of  neighbors  in 
KNN,  x,.  x,  is  the  i th  labeled  sample  and  y  is  the  input  to  be  classified. 
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Figure  8.  K  Nearest  Neighbor  Algorithm 
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III.4.2.  Fuzzy  K  Nearest  Neighbor  Algorithm 


The  fuzzy  K  nearest  neighbor  algorithm  uses  the  same  idea  of  conventional  K  nearest 
neighbor  algorithm,  that  is  finding  the  K  samples  that  are  closest  to  sample  to  be  classified. 
But  there  is  a  conceptual  difference  in  classification.  When  fuzzy  classification  is  used,  the 
input  is  not  assigned  to  a  single  class.  Instead,  the  degree  of  belongingness  of  the  input  to 
each  class  is  determined  by  the  classifier.  By  using  this  method  more  information  is 
obtained  about  the  input.  For  example  if  the  result  of  classification  determines 
membership  of  an  input  to  class  A  is  0.9  and  to  class  B  is  0.1,  it  means  the  input  belongs 
to  class  A  with  a  very  good  possibility.  But  if  the  membership  to  class  A  is  0.55  and  to 
class  B  is  0.45,  it  means  that  we  cannot  be  very  sure  about  the  classification  of  the  input. 

If  the  crisp  classifier  is  used,  in  both  cases  the  input  will  be  assigned  to  class  A  and  no 
further  information  is  obtained. 

Refer  to  [14,  15]  for  more  detailed  discussions  about  fuzzy  K  nearest  neighbor  algorithms. 
The  flowchart  for  a  fuzzy  K  nearest  neighbor  classifier  is  drawn  in  figure  9. 

The  first  step  in  the  fuzzy  K  nearest  neighbor  algorithm  is  the  same  as  first  step  in  crisp 
classifier.  In  both  cases  K  nearest  neighbors  of  the  input  are  found.  While  m  crisp 
classifier  the  majority  class  of  the  neighbors  is  assigned  to  the  input,  in  Fuzzy  classifier 
membership  of  the  input  to  each  class  should  be  found.  In  order  to  do  so  the  membership 
vector  of  each  sample  is  combined  to  obtain  the  membership  vector  of  the  input.  If  the 
samples  are  crisply  classified,  membership  vectors  should  be  assigned  to  them.  One 
method  to  do  so  is  to  assign  the  membership  of  1  to  the  class  that  it  belongs  to,  and 
membership  of  0  to  other  classes.  Other  methods  assign  different  memberships  to  the 
samples  according  to  its  distance  from  the  mean  of  the  class,  or  the  distances  from  the 
nearby  samples  of  its  own  class  and  the  other  classes. 


When  the  membership  vectors  of  the  labeled  samples  are  specified,  they  are  combined  to 
find  the  membership  vector  of  the  unknown  class.  This  procedure  should  be  done  in  a 
way  that  samples  that  are  closer  to  the  input  have  more  effect  on  the  resultant  membership 
function.  The  following  formula  uses  the  inverse  distance  to  weigh  the  membership 
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functions,  x  is  the  input  to  be  classified,  x;.  is  the  j  th  nearest  neighbor  and  ufj  is  the 
membership  of  the  j  th  nearest  neighbor  of  the  input  in  class  i.  D(x,y)  is  a  distance  measure 
between  the  vectors  x  and  y  which  could  be  the  Euclidean  distance. 

Dfx.x^) 

=  - jT~ 

/D(x,Xj)m-') 

m  is  a  parameter  that  changes  the  weighing  effect  of  the  distance.  When  m  » 1,  all  the 
samples  will  have  the  same  weight.  When  m  approaches  1,  the  nearest  samples  have  much 
more  effect  on  the  membership  value  of  the  input. 
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Set  first  K  samples 
as  K  nearest  neighbors 


End 


Yu,J(l/D(x,xj)^) 

Ui(  x)  =  ^ - X- 

2n /d(x,xj)~') 


Figure  9.  Fuzzy  K-Nearest  Neighbor 
Algorithm 
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III.4.3.  Methods  and  Discussion: 


As  mentioned  in  an  earlier  section  the  classifier  was  needed  to  compare  the  effectiveness 
of  single  features  and  to  choose  the  combinations  of  the  features  that  gave  the  best 
classification  results.  Therefore,  the  classifier  was  selected  and  used  before  the  final 
feature  set  was  determined.  The  classifier  might  change  the  results  of  the  classification 
and  finding  the  best  classifier  is  not  a  trivial  task.  For  example  using  the  value  of  10  for  K 
may  change  the  set  of  30  best  features  that  was  found  by  using  K  =  5. 

It  is  not  practical  to  try  all  different  cases  for  different  classifiers  and  different  parameters 
of  classifiers,  so  it  was  decided  to  use  a  classifier  with  fixed  parameters  up  to  the  point 
that  final  set  of  features  were  selected.  The  classifier  as  mentioned  earlier  was  a  Fuzzy  K- 
nearest  neighbor  with  the  following  parameters: 

K  =  5, 

m  =  2, 

Defuzzification  threshold  =  0.5; 

It  should  be  noted  that  in  order  to  save  computation  time  throughout  this  project,  each  set 
of  files  was  randomly  broken  into  a  training  and  a  testing  set.  Each  file  in  the  testing  set 
was  classified  using  the  labeled  files  in  training  set.  Each  experiment  was  repeated  20 
times,  and  the  results  were  averaged.  The  number  of  files  that  were  used  for  training  and 
testing  were  accordingly  75  and  25.  In  the  last  stage  of  experiments  after  the  final  feature 
set  had  been  fixed,  instead  of  randomly  selecting  testing  and  training  files,  one  file  was 
kept  for  testing  each  time  and  the  experiment  was  repeated  100  times  changing  the  test 
file. 

After  the  final  feature  set  was  selected  (Refer  to  the  section  on  Feature  Extraction), 
different  values  for  K  were  tried  on  fuzzy  and  crisp  classifier  to  compare  the  two 
classifiers  and  find  the  best  parameters.  In  addition  to  percentage  of  correct  classification 
a  measure  of  performance  was  also  used  which  is  explained  below. 

The  measure  that  is  used  to  compare  the  performance  of  fuzzy  classifier  is  the  root  mean 
square  of  the  distances  between  the  output  of  the  classifier  and  the  correct  class.  The 
correct  ouput  of  the  classifer  should  be  0  for  non-deceptive  cases  and  1  for  the  deceptive 
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ones.  For  example  if  for  a  deceptive  sample  the  classifier  output  is  0.8,  0.2  is  the  distance 
between  the  output  and  the  correct  class.  The  same  measure  is  used  for  the  crisp 
classifier.  In  the  case  of  the  crisp  classifier  the  distance  is  always  0  for  correct 
classification  and  1  for  incorrect  classification. 

For  the  fuzzy  classifier  the  threshold  used  for  defuzzification  was  also  changed  to  find  the 
optimum  value.  Tables  7  and  8  in  Appendix  A  show  the  results.  The  best  classification  on 
the  average  over  three  sets  is  obtained  using  the  fuzzy  classifier  with  K  =  6,  and  threshold 
=  0.6  Using  this  values  correct  classification  of  8 1 .6  percent  was  achieved.  The  best 
result  using  the  crisp  classifier  was  80.6  percent  which  was  obtained  using  K=6.  The 
performance  measures  for  the  fuzzy  and  crisp  classifiers  were  accordingly  0.3915  and 
0.4377  which  shows  fuzzy  classifier  has  a  better  performance  in  this  respect. 

One  final  experiment  that  was  done  is  explained  below.  In  a  Polygraph  examination  a  set 
of  questions  is  repeated  one  to  five  times  and  the  decision  is  made  by  considering  the 
responses  to  all  these  charts.  In  this  project  each  chart  was  classified  separately.  As  the 
final  experiment  responses  to  all  the  charts  in  a  Polygraph  examination  were  combined  and 
classified  as  deceptive  or  non-deceptive.  The  way  they  were  combined  was  finding  the 
majority  class  and  assigning  the  case  to  that  class.  In  the  case  that  equal  number  of  files 
classified  as  deceptive  and  non-deceptive,  the  membership  function  of  the  files  was 
averaged  and  the  case  was  classified  according  to  this  value.  The  classification  results  for 
all  the  files  in  sets  1  to  3  are  shown  in  Table  9  in  Appendix  A.  The  number  of  cases  in 
each  set  was  35.  The  number  of  misclassified  cases  in  sets  1  to  3  are  5,  7,  and  3,  which 
correspond  to  correct  classifications  of  85.7,  80.0,  and  91.4  percent. 
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IV.  Conclusion  and  future  work 


The  set  of  four  features  that  showed  best  classification  results  in  this  project  were 
Maximum  of  GSR,  Upper  Respiration  and  Lower  respiration  signals,  and  the  difference 
between  the  Maximum  and  Minimum  of  High  Cardio  signal.  These  are  all  very  simple 
time  domain  features.  The  best  classification  was  obtained  using  the  fuzzy  classifier  with 
K  =  6,  and  threshold  =  0.6  .  Using  this  values  correct  classification  of  81.6  percent  was 
achieved.  By  combining  all  the  files  in  a  Polygraph  examination  85.7  percent  correct 
classification  was  achieved  on  the  average. 

There  are  several  suggestions  for  the  future  work.  First  is  to  repeat  this  work  with  larger 
sets  of  data  files  and  observe  the  generalizability  of  the  feature  sets  obtained  in  this 
research.  A  possible  way  to  improve  the  results  is  to  change  time  frames  used  to  extract 
each  feature  for  every  question.  In  this  way  the  optimum  time  for  obtaining  a  response 
could  be  found.  Another  suggestion  is  to  try  different  methods  for  fuzzification  and 
defuzzification  of  feature  vectors  to  optimize  the  fuzzy  classifier. 
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Appendices 


Appendix  A: 
Tables 


No.  feature 


Description 


Channel 


Method 


1 


lOmean 


mean 


GSR 


lOcurve 


curve  length 


GSR 


1 


lOmed  dif 


median  of  the  derivative 


GSR 


lOmax  min 


minimum  subtracted  from  the  maximum 


GSR 


lOmax 


maximum  of  the  signal 


GSR 


lOmdif 


mean  of  derivative 


GSR 


20curve 


curve  length 


High  Cardio 


20ampcard 


amplitude  of  the  peaks 


High  Cardio 


20max  min 


minimum  subtracted  from  the  maximum 


High  Cardio 


10 


11 


20max 


20min 


maximum  of  the  signal 


High  Cardio 


minimum  of  the  signal 


High  Cardio 


12 


13 


14 


30med  dif 


30max 


40mean 


median  of  the  derivative 


Low  Cardio 


maximum  of  the  signal 


Low  Cardio 


mean 


Derivative  of  Low  Cardio 


15 


16 


17 


40max 


50curve 


50ampr 


maximum  of  the  signal 


Derivative  of  Low  Cardio 


curve  length 


Lower  Respiratory 


amplitude  of  the  peaks 


Lower  Respiratory 


18 


19 


50peaknumr 


50ie 


number  of  the  peaks 


Lower  Respiratory 


inhalation  divided  by  exhalation 


Lower  Respiratory 


20 


50max  min 


minimum  subtracted  from  the  maximum 


Lower  Respiratory 


21 


22 


23 


50max 


60max  min 


60max 


maximum  of  the  signal 


Lower  Respiratory 


minimum  subtracted  from  the  maximum 


Upper  Respiratory 


maximum 


Upper  Respiratory 


24 


25 


lOstd 


20std 


standard  deviation 


GSR 


standard  deviation 


High  Cardio 


26 


50std 


standard  deviation 


Upper  Respiratory 


27 


20armodl 


auto  regressive  parameter 


High  Cardio 


28 


29 


30 


26psdcohl 


lOisdl 


20isdl 


max  cross  spectral  density 


frequency  of  maximum  integrated  spectral 
difference  of  control-relevant  pair 


High  Cardio,  Lower  Respiratory 


GSR 


area  under  integrated  spectral  difference 


High  Cardio 


3* 


Methods:  l=Difference  of  Averages,  2=Normalized  Average,  3=Max-Max,  4=Min-Min, 
5=Max-Min,  6=Min-Max,  7=Max/Min ,  l*=Average  of  relevant-control  pairs,  3*=Max  of  relevant- 
control  pair. 


Table  1.  Selected  Features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

74.2000 

8.0000 

18.0000 

74.0000 

10.0000 

21.0000 

73.0000 

5.0000 

7.0000 

72.0000 

24.0000 

26.0000 

71.8000 

23.0000 

24.0000 

71.6000 

4.0000 

26.0000 

70.4000 

25.0000 

26.0000 

70.4000 

18.0000 

25.0000 

70.2000 

24.0000 

27.0000 

70.2000 

9.0000 

21.0000 

70.0000 

5.0000 

27.0000 

69.6000 

11.0000 

21.0000 

69.6000 

9.0000 

24.0000 

69.4000 

11.0000 

27.0000 

69.4000 

5.0000 

26.0000 

69.2000 

8.0000 

19.0000 

69.2000 

5.0000 

18.0000 

69.0000 

25.0000 

27.0000 

69.0000 

9.0000 

18.0000 

69.0000 

5.0000 

23.0000 

68.8000 

24.0000 

30.0000 

68.8000 

18.0000 

20.0000 

68.8000 

17.0000 

20.0000 

68.8000 

4.0000 

15.0000 

68.6000 

22.0000 

24.0000 

68.4000 

6.0000 

24.0000 

68.4000 

1.0000 

27.0000 

68.2000 

15.0000 

24.0000 

68.2000 

9.0000 

26.0000 

68.2000 

5.0000 

19.0000 

Table  [2.1]  Results  of  pairwise  combinations  of  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

74.4000 

5.0000 

23.0000 

74.4000 

4.0000 

27.0000 

74.2000 

4.0000 

15.0000 

74.0000 

20.0000 

24.0000 

73.6000 

16.0000 

24.0000 

73.2000 

3.0000 

27.0000 

72.8000 

27.0000 

30.0000 

72.6000 

4.0000 

30.0000 

72.6000 

4.0000 

7.0000 

72.4000 

5.0000 

25.0000 

72.2000 

24.0000 

30.0000 

72.2000 

8.0000 

27.0000 

72.2000 

4.0000 

17.0000 

72.2000 

4.0000 

16.0000 

72.0000 

24.0000 

27.0000 

72.0000 

24.0000 

25.0000 

72.0000 

4.0000 

20.0000 

71.8000 

7.0000 

23.0000 

71.8000 

4.0000 

10.0000 

71.2000 

25.0000 

27.0000 

70.8000 

24.0000 

26.0000 

70.8000 

8.0000 

22.0000 

70.6000 

7.0000 

27.0000 

70.6000 

6.0000 

27.0000 

70.4000 

14.0000 

21.0000 

70.4000 

14.0000 

20.0000 

70.4000 

4.0000 

8.0000 

70.2000 

4.0000 

24.0000 

70.0000 

22.0000 

27.0000 

70.0000 

17.0000 

24.0000 

Table  [2.2]  Results  of  pairwise  combinations  of  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Percent  correct 

Feature  1 

Feature  2 

81.0000 

1.0000 

10.0000 

80.6000 

9.0000 

24.0000 

80.4000 

10.0000  T 

24.0000 

80.4000 

4.0000 

25.0000 

80.2000  1 

4.0000 

9.0000 

79.8000 

5.0000 

11.0000 

79.2000 

17.0000 

24.0000 

79.2000 

1.0000 

21.0000 

79.2000 

1.0000 

8.0000 

79.0000 

1.0000 

24.0000 

79.0000 

1.0000 

11.0000  i 

78.8000 

4.0000 

11.0000 

78.6000 

4.0000 

17.0000 

78.2000 

24.0000 

25.0000 

78.2000 

1.0000 

14.0000 

78.0000 

1.0000 

23.0000 

78.0000 

1.0000 

20.0000 

77.8000 

23.0000 

24.0000 

77.8000 

1.0000 

5.0000 

77.6000 

19.0000 

24.0000 

77.4000 

11.0000 

24.0000 

77.4000 

5.0000 

18.0000 

77.2000 

4.0000 

19.0000 

77.0000 

4.0000 

18.0000  | 

76.8000 

4.0000 

15.0000 

76.6000 

5.0000 

13.0000 

76.6000 

4.0000 

24.0000 

76.2000 

4.0000 

5.0000 

76.2000 

1.0000 

26.0000 

Table  [2.3]  Results  of  pairwise  combinations  of  features 


Percentage  of  correct  classification  for  30  best  combinations  in  average 


Percent  correct 

Feature  1 

Feature  2 

73.2667 

4.0000 

15.0000 

72.8000 

24.0000 

26.0000 

72.6667 

4.0000 

17.0000 

72.6000 

5.0000 

23.0000 

72.2667 

23.0000 

24.0000  | 

72.0667 

24.0000 

30.0000 

71.9333 

20.0000 

24.0000 

71.8667 

24.0000 

27.0000 

71.4667 

24.0000 

25.0000 

71.4000 

4.0000 

26.0000 

71.0667 

4.0000 

10.0000 

70.9333 

1.0000 

8.0000 

70.9333 

4.0000 

23.0000 

70.6000 

5.0000 

11.0000 

70.6000 

4.0000 

24.0000 

70.5333 

9.0000 

24.0000 

70.4667 

6.0000 

24.0000 

70.4667 

4.0000 

25.0000 

70.4667 

4.0000 

19.0000 

70.4000 

4.0000 

30.0000 

70.3333 

1.0000 

23.0000 

70.0667 

17.0000 

24.0000 

70.0667 

1.0000 

24.0000 

70.0000 

16.0000 

24.0000 

69.9333 

4.0000 

9.0000 

69.8667 

4.0000 

20.0000 

69.8667 

5.0000 

7.0000 

69.8667 

4.0000 

7.0000 

69.8000 

15.0000 

24.0000 

69.8000 

1.0000 

21.0000 

Table  [2.4]  Results  of  pairwise  combinations  of  features 


CS  <N 


Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.4000 

10.0000  i 

21.0000 

26.0000 

77.6000 

5.0000 

7.0000 

23.0000 

77.6000 

5.0000 

23.0000 

11.0000 

77.4000 

5.0000 

23.0000 

21.0000 

76.4000 

16.0000 

24.0000 

18.0000 

76.4000 

5.0000 

23.0000 

19.0000 

75.8000 

23.0000 

24.0000 

19.0000 

75.8000 

23.0000 

24.0000 

15.0000 

75.8000 

5.0000 

23.0000 

7.0000 

75.6000 

5.0000 

7.0000 

22.0000 

75.6000 

5.0000 

7.0000 

21.0000 

75.6000 

5.0000 

7.0000 

16.0000 

75.4000 

5.0000 

7.0000 

14.0000 

75.4000 

5.0000 

11.0000 

10.0000 

75.2000 

10.0000 

21.0000 

19.0000 

75.2000 

8.0000 

18.0000 

6.0000 

75.2000 

5.0000 

23.0000 

2.0000 

75.0000 

10.0000 

21.0000 

16.0000 

75.0000 

10.0000 

21.0000 

8.0000 

75.0000 

5.0000 

11.0000 

18.0000 

75.0000 

4.0000 

26.0000 

14.0000 

75.0000 

5.0000 

23.0000 

29.0000 

75.0000 

5.0000 

23.0000 

25.0000 

74.8000 

10.0000 

21.0000 

9.0000 

74.6000 

10.0000 

21.0000 

12.0000 

74.6000 

5.0000 

11.0000 

23.0000 

74.6000 

10.0000 

24.0000 

9.0000 

74.6000 

5.0000 

23.0000 

10.0000 

74.6000 

5.0000 

23.0000 

9.0000 

74.4000 

5.0000 

7.0000 

19.0000 

Table  [4.1]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.8000 

20.0000 

24.0000 

12.0000 

78.6000 

24.0000 

30.0000 

19.0000 

78.6000 

4.0000  ’ 

15.0000 

28.0000 

78.0000 

24.0000 

27.0000 

19.0000 

77.8000 

4.0000 

17.0000 

19.0000 

77.6000 

8.0000 

18.0000 

4.0000 

77.4000 

4.0000 

27.0000 

19.0000 

77.4000 

5.0000 

23.0000 

21.0000  ! 

77.2000 

5.0000 

23.0000  1 

29.0000 

77.2000 

4.0000  | 

15.0000 

27.0000 

77.0000 

4.0000  1 

27.0000 

18.0000  1 

77.0000 

4.0000 

15.0000  I 

21.0000 

76.6000 

5.0000 

7.0000 

23.0000 

76.6000  _1 

20.0000 

24.0000 

3.0000 

76.4000 

16.0000 

24.0000  1 

30.0000 

76.4000 

4.0000 

27.0000 

25.0000 

76.4000 

24.0000 

27.0000 

10.0000 

76.4000 

23.0000 

24.0000 

30.0000 

76.2000 

5.0000 

23.0000 

3.0000 

76.2000 

4.0000 

17.0000 

2.0000 

76.2000 

4.0000 

15.0000 

26.0000 

75.8000 

5.0000 

7.0000 

15.0000 

75.8000 

24.0000 

30.0000 

4.0000  ] 

75.8000 

5.0000 

23.0000 

28.0000 

75.6000 

4.0000 

27.0000 

15.0000 

75.6000 

24.0000 

27.0000 

26.0000 

75.6000 

24.0000 

27.0000 

1.0000 

75.6000 

20.0000 

24.0000 

25.0000 

75.6000 

24.0000 

30.0000 

16.0000 

75.4000 

4.0000 

15.0000 

8.0000 

Table  [4.2]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Table  [4.3]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  on  average 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

78.2000 

5.0000 

23.0000 

29.0000 

77.6000 

5.0000 

7.0000 

23.0000 

77.3333 

5.0000 

23.0000 

21.0000 

76.6000 

5.0000 

23.0000 

10.0000 

76.0000 

23.0000 

24.0000 

15.0000  1 

75.8667 

5.0000 

7.0000 

21.0000 

75.8667 

5.0000 

23.0000 

7.0000 

75.6667 

5.0000 

23.0000 

11.0000 

75.6000 

8.0000 

18.0000 

4.0000 

75.5333 

4.0000 

17.0000 

19.0000 

75.5333 

5.0000 

11.0000 

17.0000 

75.5333 

24.0000 

26.0000 

14.0000 

75.4667 

5.0000 

23.0000 

28.0000 

75.4667 

4.0000 

15.0000 

26.0000 

75.3333 

17.0000 

24.0000 

19.0000 

75.3333 

5.0000 

23.0000 

25.0000 

75.2000 

5.0000 

7.0000 

17.0000 

75.2000 

4.0000 

15.0000 

23.0000 

75.0000 

5.0000 

23.0000 

17.0000 

74.9333 

5.0000 

23.0000 

3.0000 

74.8667 

4.0000 

26.0000 

15.0000 

74.8000 

23.0000 

24.0000 

19.0000 

74.8000 

5.0000 

23.0000 

14.0000 

74.8000 

5.0000 

23.0000 

1.0000 

74.8000 

24.0000 

26.0000 

25.0000 

74.7333 

24.0000 

30.0000 

19.0000 

74.7333 

5.0000 

23.0000 

19.0000 

74.7333 

5.0000 

23.0000 

9.0000 

74.6667 

5.0000 

7.0000 

22.0000 

74.6667 

4.0000 

26.0000 

19.0000 

Table  [4.4]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

Feature  3  1  Feature  4  1 

81.0000 

5.0000 

21.0000 

23.0000 

9.0000 

80.6000 

5.0000 

7.0000 

23.0000 

6.0000 

80.2000 

5.0000 

21.0000 

23.0000 

11.0000 

79.6000 

5.0000 

21.0000 

23.0000 

10.0000 

79.4000 

5.0000  1 

7.0000 

23.0000 

12.0000 

79.4000 

5.0000 

10.0000 

23.0000 

21.0000  ] 

79.0000 

5.0000 

7.0000 

23.0000 

28.0000 

79.0000 

5.0000 

7.0000 

23.0000  1 

19.0000 

79.0000 

5.0000 

21.0000 

23.0000 

26.0000 

78.8000 

5.0000 

11.0000 

23.0000  1 

7.0000 

78.6000 

5.0000 

21.0000 

23.0000 

12.0000 

78.4000 

5.0000 

21.0000 

23.0000 

15.0000 

78.4000 

5.0000 

10.0000 

23.0000 

8.0000 

78.0000 

5.0000 

11.0000 

23.0000 

21.0000 

78.0000 

5.0000 

7.0000 

23.0000 

20.0000 

78.0000 

5.0000 

7.0000 

23.0000 

14.0000 

77.8000 

5.0000 

7.0000 

23.0000 

2.0000 

77.8000 

5.0000 

21.0000 

23.0000 

28.0000 

77.8000 

5.0000 

21.0000 

23.0000 

6.0000 

77.8000 

5.0000 

21.0000 

23.0000 

3.0000 

77.8000 

5.0000 

23.0000 

29.0000 

26.0000 

77.8000 

5.0000 

23.0000 

29.0000 

22.0000 

77.6000 

10.0000 

21.0000 

26.0000 

2.0000 

77.6000 

5.0000 

7.0000 

23.0000 

22.0000 

77.6000 

5.0000 

10.0000 

23.0000 

19.0000 

77.6000 

5.0000 

23.0000 

29.0000 

19.0000 

77.6000 

5.0000 

23.0000 

29.0000 

1.0000 

77.4000 

10.0000 

21.0000 

26.0000 

9.0000 

77.4000 

5.0000 

11.0000 

23.0000 

10.0000 

77.4000 

5.0000 

11.0000 

23.0000 

8.0000 

Table  [6.1]  Results  of  combinations  of  4  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

81.0000 

5.0000 

23.0000 

29.0000 

14.0000 

79.8000 

5.0000 

10.0000 

23.0000 

21.0000 

79.6000 

5.0000 

21.0000 

23.0000 

11.0000 

79.4000 

14.0000 

24.0000 

26.0000 

19.0000 

79.4000 

5.0000 

21.0000 

23.0000 

9.0000 

i  79.2000 

5.0000 

21.0000 

23.0000 

13.0000 

79.0000 

5.0000 

11.0000 

23.0000 

3.0000 

79.0000 

5.0000 

23.0000 

29.0000 

21.0000 

78.8000 

5.0000 

23.0000 

29.0000 

6.0000 

78.6000 

4.0000 

19.0000 

17.0000 

25.0000 

j  78.6000 

5.0000 

21.0000 

23.0000 

10.0000 

78.4000 

4.0000 

19.0000 

17.0000 

6.0000 

78.4000 

5.0000 

23.0000 

29.0000 

19.0000 

78.2000 

5.0000 

11.0000 

23.0000 

25.0000 

78.2000 

5.0000 

11.0000 

23.0000 

6.0000 

78.2000 

4.0000 

15.0000 

28.0000 

27.0000 

78.2000 

5.0000 

7.0000 

23.0000 

11.0000 

78.2000 

19.0000 

24.0000 

30.0000 

11.0000 

78.0000 

5.0000 

21.0000 

23.0000 

27.0000 

77.8000 

19.0000 

24.0000 

30.0000 

23.0000 

77.8000 

19.0000 

24.0000 

30.0000 

16.0000 

77.8000 

5.0000 

10.0000 

23.0000 

11.0000 

77.6000 

4.0000 

19.0000 

17.0000 

3.0000 

77.6000 

5.0000 

7.0000 

23.0000 

28.0000 

77.4000 

14.0000 

24.0000 

26.0000 

20.0000 

77.4000 

5.0000 

21.0000 

23.0000 

30.0000 

77.2000 

5.0000 

11.0000 

23.0000 

8.0000 

77.2000 

4.0000 

19.0000 

17.0000 

11.0000 

77.2000 

5.0000 

7.0000 

23.0000 

26.0000 

77.2000 

5.0000 

21.0000 

23.0000 

12.0000 

Table  [6.2]  Results  of  combinations  of  4  features 


Percentage  of  correct  classification  for  30  best  combinations  m  set  3 


Pprrpnt  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

87  4000 

9.0000 

19.0000 

24.0000 

14.0000 

R7  2000 

9.0000 

14.0000 

24.0000 

19.0000 

87  0000 

9.0000 

19.0000 

24.0000 

11.0000 

86  8000 

9.0000 

19.0000 

24.0000 

18.0000 

86  6000 

5.0000 

21.0000 

23.0000 

29.0000 

86  6000  1 

9.0000  1 

19.0000 

24.0000 

16.0000 

86  4000 

9.0000 

19.0000  ‘ 

24.0000 

21.0000 

86  4000 

4.0000 

17.0000 

26.0000 

18.0000 

86  2000 

4.0000 

11.0000 

26.0000 

24.0000 

86  7000 

4.0000 

8.0000 

18.0000 

9.0000 

86  2000  j 

9.0000 

19.0000 

24.0000 

22.0000 

86  2000 

9.0000  1 

19.0000 

24.0000 

6.0000 

86  0000  | 

9.0000 

19.0000 

24.0000 

12.0000 

86  0000  ~ 1 

9.0000 

19.0000 

24.0000 

10.0000 

RS  8000 

9.0000 

19.0000 

24.0000 

26.0000 

OJ.OvuV 

85  8000 

4.0000 

17.0000 

26.0000 

9.0000 

#UvW 

85  6000 

5.0000 

7.0000 

21.0000 

16.0000 

OJ.Uvvv  _ 

85  6000 

5.0000 

7.0000 

21.0000 

8.0000 

85  6000 

9.0000 

19.0000 

24.0000 

8.0000 

85  6000 

9.0000 

19.0000 

24.0000 

5.0000 

85  6000 

9.0000 

19.0000 

24.0000 

1.0000 

0*/.vvvv 

85  4000 

9.0000 

14.0000 

24.0000 

4.0000 

85.4000 

5.0000 

21.0000 

23.0000 

1.0000 

85  2000 

4.0000 

19.0000 

17.0000 

10.0000 

85.2000 

9.0000 

19.0000 

24.0000 

4.0000  ’ 

85  0000 

5.0000 

11.0000 

17.0000 

4.0000 

85  0000 

9.0000 

19.0000 

24.0000 

2.0000 

85  0000 

4.0000 

17.0000 

26.0000 

8.0000 

84.8000 

4.0000 

11.0000 

26.0000 

9.0000 

84.8000 

5.0000 

21.0000 

23.0000 

22.0000 

Table  [6.3]  Results  of  combinations  of  4  features 


3-A-16 


Percentage  of  correct  classification  for  30  best  combinations  on  average 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

81.0667  1 

5.0000 

21.0000 

23.0000 

9.0000 

79.9333 

5.0000 

23.0000 

29.0000 

21.0000 

79.8667 

5.0000 

21.0000 

23.0000 

11.0000 

79.6000  1 

5.0000 

10.0000 

23.0000 

21.0000 

79.2667 

5.0000 

23.0000 

29.0000 

19.0000 

79.1333 

5.0000 

21.0000 

23.0000 

10.0000 

79.0667 

5.0000 

23.0000 

29.0000 

14.0000 

79.0000 

14.0000 

24.0000 

26.0000 

19.0000 

78.9333 

5.0000 

7.0000 

23.0000 

12.0000 

78.8667 

5.0000 

21.0000 

23.0000 

22.0000 

78.8667 

5.0000 

7.0000 

23.0000 

28.0000 

78.7333 

5.0000 

7.0000 

23.0000 

6.0000 

78.6667 

5.0000 

21.0000 

23.0000 

7.0000 

78.5333 

5.0000 

21.0000 

23.0000 

1.0000 

78.4667 

5.0000 

23.0000 

29.0000 

1.0000 

78.4000 

5.0000 

7.0000 

21.0000 

8.0000 

78.4000 

5.0000 

7.0000 

23.0000 

26.0000 

78.2667 

5.0000 

7.0000 

23.0000 

11.0000 

78.2000 

5.0000 

7.0000 

23.0000 

22.0000 

78.2000 

5.0000 

23.0000 

29.0000 

28.0000 

78.1333 

5.0000 

11.0000 

23.0000 

10.0000 

78.1333 

5.0000 

10.0000 

23.0000 

25.0000 

78.0667 

5.0000 

7.0000 

23.0000 

16.0000 

78.0000 

5.0000 

7.0000 

23.0000 

20.0000 

77.8667 

5.0000 

10.0000 

23.0000 

29.0000 

Table  [6.4]  Results  of  combinations  of  4  features 


3-A-17 


k 

Correct 

classification 

Performance 

Index 

1 

73 

0.5196 

2 

74 

0.5099 

3  1 

77 

0.4796 

4 

77 

0.4796 

5 

82 

0.42 

6 

81 

0.4359 

7 

76 

0.4899 

8 

80 

0.4472  1 

9 

79 

0.4583 

10 

79 

0.4583 

Table[7.1]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  1 


k 

Correct 

classification 

Performance 

Index 

1 

74 

0.5099 

2 

74 

0.5099 

3 

77 

0.4796 

4 

77 

0.4796 

5 

74 

0.5099 

6 

76 

0.4899 

7 

76 

0.4899 

8 

75 

0.5000 

9 

78 

0.4690 

10 

78 

0.4690 

Table[7.2]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  2 


3- A- 18 


k 

Correct 

classification 

Performance  Index 

1 

79 

0.4583 

2 

79 

0.4583 

3 

81 

0.4359 

4 

84 

0.4000 

5 

83 

0.4123 

6 

85 

0.3873 

7 

81 

0.4359 

8 

81 

0.4359 

9 

82 

0.4243 

10 

82 

0.4243 

Table[7.3]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  3 


k 

Correct 

classification 

Performance 

Index 

1 

75.3333  i 

0.4959 

2 

75.6667 

0.4927 

3 

78.3333 

0.4650 

4 

79.3333 

0.4531 

5 

79.6667 

0.4474 

6 

80.6667 

0.4377 

7 

77.6667 

0.4719 

8 

78.6667 

0.4610 

9 

79.6667 

0.4505 

10 

79.6667 

0.4505 

Table[7.4]  Average  classification  results  with  changing  K  for  the  crisp  classifier 


3 -A-  19 


percent  classification 

performanc 
e  index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

73 

73 

73 

73 

73 

73 

0.5196 

2 

77 

75 

73 

74 

72 

73 

0.4267 

3 

75 

74 

77 

75 

73 

69  1 

0.4261 

4 

75 

74 

76 

77 

76 

69 

0.4157 

5 

74 

74 

81 

79 

76 

73 

0.4061 

6 

69 

74 

78 

79 

76 

74 

0.3993 

7 

70 

74 

77 

81 

77 

72 

0.3980 

8 

70 

75 

79 

79 

79 

72 

0.3977 

9 

69 

72 

78 

80 

79 

71 

0.3971 

10 

68 

73 

78 

79 

79 

70  . 

0.3978 

Table[8.1]  Classification  results  for  the  fuzzy  classifier  for  set  1 


percent  classification 

performance 

index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

74 

74 

74 

74 

74 

74 

0.5099 

2 

72 

75 

74 

77 

78 

77 

0.4328 

3 

73 

75 

79 

79 

77 

73 

0.4316 

4 

73 

75 

79 

76 

76 

72 

0.4262 

5 

71 

76 

76 

78 

77 

74 

0.4176 

6 

72 

73 

76 

79 

75 

72 

0.4164 

7 

71 

73 

79 

79 

77 

70 

0.4092 

8 

69 

74 

78 

80 

77 

70 

0.4099 

9 

73 

75 

80 

79 

77 

70 

0.4059 

10 

72 

73 

81 

79 

76 

72 

0.4004 

Table[8.2]  Classification  results  for  the  fuzzy  classifier  for  set  2 


percent  classification 

performance 

index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

79 

79 

79 

79 

79 

79 

0.4583 

2 

73 

76 

EiH 

84 

84 

84 

0.3991 

3 

72 

75 

e» 

85 

85 

82 

0.3862 

4 

75 

78 

84 

86 

86 

83 

0.3704 

5 

74 

80 

83 

86 

86 

84 

0.3635 

6 

75 

82 

85 

87 

85 

OH 

0.3588 

7  • 

74 

80 

82 

84 

84 

82 

0.3605 

8 

73 

78 

83 

84 

84 

81 

0.3638 

9 

73 

79 

83 

84 

85 

Em 

0.3625 

10 

73 

80 

83 

84 

85 

EEH 

0.3615 

Table[8.3]  Classification  results  for  the  fuzzy  classifier  for  set  3 


percent  classification 

performanc 
e  index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

75.33 

75.33 

75.33 

75.33 

75.33 

0.4959 

2 

74 

75.33 

75.33 

Brylin 

78 

0.4195 

3 

73.33 

74.67 

79 

79.67 

78.33 

74.67 

0.4146 

4 

74.33 

75.67 

79.67 

79.67 

79.33 

74.67 

0.4041 

5 

73 

76.67 

80 

81 

79.67 

77 

0.3957 

6 

72 

76.33 

79.67 

81.67 

76.33 

0.3915 

7 

71.67 

75.67 

81.33 

KIM 

0.3892 

8 

70.67 

75.67 

81 

80 

KIM 

0.3905 

9 

71.67 

75.33 

80.33 

81 

80.33 

74 

0.3885 

10 

71 

75.33 

80.67 

80.67 

80 

74.67 

0.3866 

Table[8.3]  Average  classification  results  with  for  the  fuzzy  classifier 


3-A-21 


File 

Membership 

Defuzzified 

Result 

1.0000 

0.2736 

0 

2.0000 

0.3339 

0 

3.0000 

0.5397 

0 

0 

1  l 

4.0000 

0.5450 

0 

5.0000 

0.7423 

1.0000 

6.0000 

0.1732 

0 

0 

l 

7.0000 

0.8901 

1.0000 

8.0000 

1.0000 

1.0000 

1  Misclassified 

1  l 

9.0000 

0.5376 

0 

10.0000 

0.1742 

0 

11.0000 

0.4366 

0 

0 

1 

12.0000 

0.3458 

0 

13.0000 

0.5145 

0 

14.0000 

0.5178 

0 

0 

1 

15.0000 

0.1016 

0 

16.0000 

0 

0 

17.0000 

0 

0 

0 

1  i 

i  18.0000 

0.1334 

0 

0 

1 

19.0000 

0 

0 

20.0000 

0 

0 

21.0000 

0.2923 

0 

0 

22.0000 

0 

0 

23.0000 

0 

0 

24.0000 

0.1607 

0 

0 

1 

25.0000 

0 

0 

26.0000 

0.4421 

0 

27.0000 

1.0000 

1.0000 

0 

1 

28.0000 

0.3307 

0 

29.0000 

0.0583 

0 

30.0000 

0.4965 

6 

0 

1 

31.0000 

0.3505 

0 

32.0000 

0.1181 

0 

33.0000 

0.2101 

0 

0 

Table  [9.1]  Classification  of  the  files  of  set  1 


File  Membership  Defuzzified 

Result 

34.0000  0.5970  0 

35.0000  0  0 

36.0000  0.1193  0 

0 

37.0000  0.3174  0 

38.0000  0.8117  1.0000 

39.0000  0.0997  0 

0 

40.0000  0.1889  0 

41.0000  0.4215  0 

42.0000  0.1635  0 

0 

43.0000  0.6474  1.0000 

44.0000  0  0 

45.0000  0.5495  0 

0 

46.0000  0.1115  0 

0 

47.0000  0  0 

48.0000  0.3986  0 

49.0000  0  0 

50.0000  0  0 

0 

51.0000  0.6709  1.0000 

52.0000  1.0000  1.0000 

53.0000  0.5297  0 

1 

54.0000  0.7245  1.0000 

55.0000  0.9200  1.0000 

56.0000  1.0000  1.0000 

1 

57.0000  0.9105  1.0000 

58.0000  0.9398  1.0000 

59.0000  0.5657  0 

1 

60.0000  0.8968  1.0000 

61.0000  1.0000  1.0000 

62.0000  0.2793  0 

63.0000  0.1088  0 

0  Misclassified 

64.0000  0.6245  1.0000 

65.0000  0.8643  1.0000 

66.0000  0.5054  0 

1 

Table  [9.1]  Continued 


File 

Membership 

Defuzzified 

Result 

67.0000 

0.8498 

1.0000 

68.0000 

0.6969 

1.0000 

69.0000 

0.8397 

1.0000 

1 

I 

70.0000 

0.2901 

0 

71.0000 

0.8291 

1.0000 

72.0000 

0.3982 

0 

0  Misclassifled 

l 

73.0000 

1.0000 

1.0000 

74.0000 

0.2463 

0 

75.0000 

0.8043 

1.0000 

1 

l 

76.0000 

0.6676 

1.0000 

77.0000 

1.0000 

1.0000 

78.0000 

1.0000 

1.0000 

1 

1 

|  79.0000 

1.0000 

1.0000 

80.0000 

0.7538 

1.0000 

81.0000 

1.0000 

1.0000 

1 

-  1 

82.0000 

1.0000 

1.0000 

83.0000 

0.8378 

1.0000 

!  84.0000 

1.0000 

1.0000 

1 

85.0000 

0.8926 

1.0000 

86.0000 

0.5448 

0 

87.0000 

0.5751 

0 

0  Misclassified 

88.0000 

0.8273 

1.0000 

89.0000 

0.2945 

0 

90.0000 

0.9110 

1.0000 

1 

91.0000 

1.0000 

1.0000 

92.0000 

1.0000 

1.0000 

93.0000 

0 

0 

1 

94.0000 

0.2887 

0 

95.0000 

0.2079 

0 

96.0000 

0.5793 

0 

0  Misclassified 

97.0000 

1.0000 

1.0000 

98.0000 

0.7971 

1.0000 

99.0000 

0.8708 

1.0000 

1 

1  100.0000 

1.0000 

1.0000 

1 

Table  [9.1]  Continued 


File 

Membership 

Defuzzified 

Result 

1.0000 

0.2579 

0 

2.0000 

0.1307 

0 

3.0000 

0 

0 

0 

4.0000 

0.2652 

0 

5.0000 

0.4345 

0 

6.0000 

0.1175 

0 

0 

7.0000 

1.0000 

1.0000 

8.0000 

0.7086 

1.0000 

1  Misclassified 

9.0000 

0.2856 

0 

10.0000 

0.2745 

0 

11.0000 

0.3056 

0 

0 

12.0000 

0.2720 

0 

13.0000 

0.5019 

0 

14.0000 

0.8871 

1.0000 

0 

15.0000 

0.0912 

0 

16.0000 

0 

0 

17.0000 

0 

0 

0 

18.0000 

0.8334 

1.0000 

1  Misclassified 

19.0000 

0 

0 

20.0000 

0 

0 

21.0000 

0.5483 

0 

0 

22.0000 

0 

0 

23.0000 

0 

0 

24.0000 

0.1535 

0 

0 

25.0000 

0.4955 

0 

26.0000 

0.1013 

0 

27.0000 

1.0000 

1.0000 

0 

28.0000 

0.3788 

0 

29.0000 

0.1638 

0 

30.0000 

0.0905 

0 

0 

31.0000 

0 

0 

32.0000 

0.1431 

0 

33.0000 

0.0937 

0 

0 

Table  [9.2]  Classification  of  the  files  of  set  2 


File  Membership  Defuzzified 

Result  I 

j  34.0000  0  0 

1 

35.0000  0  0 

1 

1"  36.0000  0.1281  0 

0 

1  37.0000  0.3690  0 

1 

I  38.0000  0.5734  0 

I 

39.0000  0.1569  0 

0  l 

40.0000  0.3659  0 

-I 

I  41.0000  0.4124  0  _  _ 

_ I 

42.0000  0.1704  0 

0  1 

I  43.0000  0.4251  0 

_ I 

|  44.0000  0.0664  0 

1 

45.0000  0.5356  0 

0 

I  46.0000  0.5084  0 

0 

|  47.0000  0.1735  0 

—1 

48.0000  0.7512  1.0000 

I 

|  49.0000  0.5115  0  1 

-1 

50.0000  0.0976  0 

0  1 

I  51.0000  0.6361  1.0000 

- 3 

1  52.0000  0.8482  1.0000 

1  1 

1 

1  53.0000  0.3471  0 

1  54.0000  0.8822  1.0000 

y~  _| 

|  55.0000  1.0000  1.0000 

i  | 

1  56.0000  1.0000  1.0000 

_ I 

|  57.0000  1.0000  1.0000 

1 

I  58.0000  0.8730  1.0000 

i 

I  59.0000  0  0 

j 

I  60.0000  0.0389  0 

"i  | 

f  61.0000  0.3643  0 

0  Misclassificd  1 

1  62.0000  1.0000  1.0000 

-  I 

63.0000  0.8174  1.0000 

— 1 

j  64.0000  0.8875  1.0000 

1 

!  65.0000  0.7995  1.0000 

1 

f  66.0000  0.5919  0 

1 

67.0000  0.7533  1.0000 

1 

Table  [9.2]  Continued 


|  File  Membership 

Defuzzifled 

Result 

68.0000 

0.7337 

1.0000 

69.0000 

0.8524 

1.0000 

70.0000 

0.8602 

1.0000 

1 

73.. 0000 

0.2217 

o  1 

72.0000 

1.0000 

1.0000 

73.0000 

0.1268 

0 

0  Misclassified 

74.0000 

0.8860 

1.0000 

75.0000 

0.2121 

0 

76.0000 

0.1684 

0 

77.0000 

0.6903 

1.0000 

0  Misclassified 

78.0000 

0.7680 

1.0000 

79.0000 

0.8735 

1.0000 

80.0000 

0.8013 

1.0000 

1 

81.0000 

0.1748 

0 

82.0000 

0.5428 

0 

83.0000 

0.8496 

1.0000 

0  Misclassified 

84.0000 

0.3444 

0 

85.0000 

0.8298 

1.0000 

86.0000 

0.8590 

1.0000  ! 

1 

87.0000 

0.6879 

1.0000 

88.0000 

0.9082 

1.0000 

89.0000 

0.6653 

1.0000 

1 

90.0000 

0.1636 

0 

91.0000 

0.8754 

1.0000 

92.0000 

0.8594 

1.0000 

1 

93.0000 

0.5185 

0 

94.0000 

0.4932 

0 

95.0000 

0.7802 

1.0000 

0  Misclassified 

96.0000 

0.8684 

1.0000 

97.0000 

0.8788 

1.0000 

98.0000 

1.0000 

1.0000 

1 

99.0000 

1.0000 

1.0000 

100.0000 

0.8669 

1.0000 

1 

Table  [9.2]  Continued 


1  File  Membership 

Defuzzified 

Result 

1.0000 

0.3986 

0 

2.0000 

0.2845 

0 

3.0000 

0.2562 

0 

0 

4.0000 

0.2786 

0 

5.0000 

0.3226 

0 

6.0000 

0 

0 

0 

7.0000 

1.0000 

1.0000 

8.0000 

0.5055 

0 

9.0000 

0.1434 

0 

0 

10.0000 

0 

0 

11.0000 

0 

0 

0 

12.0000 

0.0691 

0 

13.0000 

0.4744 

0 

14.0000 

0.4708 

0 

0 

15.0000 

0 

0 

16.0000 

0 

0 

17.0000 

0 

0 

0 

18.0000 

0.4623 

0 

0 

19.0000 

0 

0 

20.0000 

0 

0 

21.0000 

0.2096 

0 

0 

22.0000 

0 

0 

23.0000 

0 

0 

24.0000 

0.0516 

0 

0 

25.0000 

0.2885 

0 

26.0000 

0.0981 

0 

27.0000 

0.9336 

1.0000 

0 

28.0000 

0.2254 

0 

29.0000 

0.1465 

0 

30.0000 

0.0680 

0 

0 

31.0000 

0 

0 

32.0000 

0 

0 

33.0000 

0.0939 

0 

0 

Table  [9.3]  Classification  of  the  files  of  set  3 


File 

Membership 

Defuzzifled 

Result 

34.0000 

0.3917 

0 

35.0000 

0 

0 

36.0000 

0 

0 

0 

37.0000 

0.1689 

0 

38.0000 

0.5220 

0 

39.0000 

0 

0 

0 

40.0000 

0.0969 

0 

41.0000 

0 

0 

42.0000 

0 

0 

0 

43.0000 

0.4810 

0 

44.0000 

0.3154 

0 

45.0000 

0.4552 

0 

0 

• 

46.0000 

0.3285 

0 

0 

47.0000 

0.3690 

0 

48.0000 

0.5593 

0 

49.0000 

0.3522 

0 

50.0000 

0.2325 

0 

0 

51.0000 

1.0000 

1.0000 

52.0000 

0.9052 

1.0000 

53.0000 

0.8115 

1.0000 

1 

54.0000 

0.8397 

1.0000 

55.0000 

0.8754 

1.0000 

56.0000 

0.0930 

0 

1 

57.0000 

0.8330 

1.0000 

58.0000 

1.0000 

1.0000 

1 

59.0000 

1.0000 

1.0000 

60.0000 

1.0000 

1.0000 

61.0000 

1.0000 

1.0000 

1 

62.0000 

1.0000 

1.0000 

63.0000 

0.6496 

1.0000 

64.0000 

0.5075 

0 

1 

65.0000 

0.0823 

0 

66.0000 

0.7810 

1.0000 

67.0000 

0.2356 

0 

0  Misclassified 

Table  [9.3]  Continued 


|  File  Membership  Defuzzified 

Result 

68.0000  1.0000  1.0000 

}“  69.0000  1.0000  1.0000 

70.0000  1.0000  1.0000 

1 

1  71.0000  1.0000  1.0000 

1  72.0000  1.0000  1.0000 

j  73.0000  1.0000  1.0000 

1 

1  74.0000  1.0000  1.0000 

1  75.0000  1.0000  1.0000 

1  76.0000  1.0000  1.0000 

1 

1  77.0000  1.0000  1.0000 

P  78.0000  1.0000  1.0000 

79.0000  1.0000  1.0000 

1 

1  80.0000  0.6068  1.0000 

1  81.0000  0.9054  1.0000 

I  82.0000  0.4134  0 

1 

I  83.0000  1.0000  1.0000  J 

|  84.0000  0  0 

t  85.0000  0.2914  0 

0  Misclassified 

j  86.0000  1.0000  1.0000 

1  87.0000  1.0000  1.0000 

88.0000  0.8786  1.0000 

1 

j  89.0000  0.9018  1.0000 

|  90.0000  1.0000  1.0000 

I  91.0000  1.0000  1.0000 

1 

J 

1  92.0000  1.0000  1.0000 

|  93.0000  0.9135  1.0000 

I-  94.0000  0.8292  1.0000 

1 

1  95.0000  0.7423  1.0000 

1  ~  96.0000  1.0000  1.0000 

I  97.0000  0.0902  0 

1 

j  98.0000  0.2564  0 

I  99.0000  0  0 

j  100.0000  0.4387  0 

0  Misclassified 

Table  [9.3]  Continued 


Non  deceptive 

Deceptive  1 

Deceptive  2 

Deceptive  3 

QQ8R9OIO.011 

QQ8R9OIO.021 

QQ8R9OIO.03 1 

QQ95LU1T.011 

QQ95LU1T.021 

QQ95LU1T.031 

QQAURNUS.021 

QQAURNUS.031 

QQAV53P6.011 

QQAV53P6.021 

QQAV53P6.03 1 

QQBQ4SHI.011 

QQBQ4SHI.021 

QQBQ4SH3.03 1 

QQBSS7WT.011 

QQBSS7WT.021 

QQBSS7WT.03 1 

QQ70XM60.021 

QQ7RH0RO.011 

QQ7RH0RO.02 1 

QQ7RH0RO.031 

QQ7R51P9.011 

QQ7R51P9.021 

QQ7R51P9.031 

QQ9TDSP3.011 

QQ9TDSP3.021 

QQ9TDSP3.031 

QQA8OWOI.0 1 1 

QQA8OWOI.021 

QQA8OWOI.03 1 

QQBT22O6.011 

QQBT22O6.021 

QQBT22O6.03 1 

QQBO9O_9.011 

QQBO9O9.021 

QQBO9O9.031 

QQBC7PP6.011 

QQBC7PP6.021 

QQBC7PP6.03 1 

QQCHCKO.Oll 

QQCHCKO.021 

QQCHCKO.03 1 

QQCDTKP0.011 

QQCDTKP0.031 

QQCDTKP0.041 

QQCM5Y56.011 

QQCQQT8Y.011 

QQCQQT8Y.021 

QQCQQT8Y.031 

QQCQQT8Y.041 


QQ4Q 1083 .Oil 

QQ4Q1O83.021 

QQ4Q1O83.031 

QQ4Q3MDC.011 

QQ4Q3MDC.021 

QQ4Q3MDC.03 1 

QQ51DE36.011 

QQ51DE36.021 

QQ51DE36.041 

QQ6RQGH6.011 

QQ6RQGH6.021 

QQ6RQGH6.03 1 

QQ6RQGH6.041 

QQ6T7 110.011 

QQ6T7 110.021 

QQ6T7 110.031 

QQ6Z59IG.011 

QQ6Z59IG.021 

QQ6Z59IG.03 1 

QQ7PP9B9.011 

QQ7PP9B9.021 

QQ7PP9B9.031 

QQ7PDU1X.011 

QQ7PDU1X.021 

QQ7PDU1X.031 

QQ7JPEPF.011 

QQ7_PIPF.021 

QQ7_PIPF.03 1 

QQ7JT70.011 

QQ7JT70.021 

QQ7_JT70.03 1 

QQ738DYX.011 

QQ738DYX.021 

QQ738DYX.031 

QQ75ULP9.011 

QQ75ULP9.021 

QQ75ULP9.03 1 

QQ79EYF.011 

QQ79EYF.021 

QQ79EYF.03 1 

QQ7BGDML.011 

QQ7BGDML.021 

QQ7BGDML.031 

QQ7ETC8I.011 

QQ7ETC8I.021 

QQ7ETC8I.03 1 

QQ7JAQCS.011 

QQ7JAQCS.021 

QQ7JAQCS.031 

QQ7LX5Q0.011 


QQ7LX5Q0.021 

QQ7LX5Q0.031 

QQ7MN2Y0.011 

QQ7MN2Y0.021 

QQ7MN2Y0.03 1 

QQ7TC5UF.011 

QQ7TC5UF.021 

QQ7TC5UF.031 

QQ7TQVER.011 

QQ7TQVER.021 

QQ7TQVER.031 

QQ7TVADC.011 

QQ7TVADC.021 

QQ7TVADC.031 

QQ7U2T4R.011 

QQ7U2T4R.021 

QQ7U2T4R.03 1 

QQ7YP7QU.011 

QQ7YP7QU.021 

QQ7YP7QU.03 1 

QQ7YZOJ3.011 

QQ7YZOJ3.021 

QQ7YZOJ3.03 1 

QQ80DPT.011 

QQ8_0DPT.021 

QQ80DPT.03 1 

QQ80DPT.041 

QQ8_2UQ9.011 

QQ82UQ9.021 

QQ82UQ9.03 1 

QQ800IG6.011 

QQ800IG6.021 

QQ800IG6.031 

QQ82OIU9.011 

QQ82OIU9.021 

QQ82OIU9.03 1 

QQ82SUTX.011 

QQ82SUTX.021 

QQ82SUTX03 1 

QQ860ZNU.011 

QQ860ZNU.021 

QQ860ZNU.031 

QQ89U_ZR.011 

QQ89U_ZR.021 

QQ89U_ZR.03 1 

QQ8ATU26.011 

QQ8ATU26.021 

QQ8ATU26.03 1 

QQ8FGMVI.011 

QQ8FGMVI.021 


QQ8RAJ0C.011 

QQ8RAJ0C.021 

QQ8RAJ0C.03 1 

QQ9EUKVT.011 

QQ9EUKVT.021 

QQ9EUKVT.03 1 

QQ9IOOXO.021 

QQ9IOOXO.041 

QQ9SOW8L.011 

QQ9SOW8L.021 

QQ9SOW8L.03 1 

QQ9SQIK9.011 

QQ9SQIK9.021 

QQ9SQDC9.03 1 

QQ9W0B9F.011 

QQ9  W0B9F.03 1 

QQ9W0B9F.O41 

QQ9U4FMU.011 

QQ9U4FMU.021 

QQ9U4FMU.03 1 

QQ9Y_SVF.011 

QQ9Y_SVF,02 1 

QQ9YSVF.03 1 

QQ9YH3QF.011 

QQ9YH3QF.021 

QQ9YH3QF.03 1 

QQA2TT4C.01 1 

QQA2TT4C.021 

QQA2TT4C.03 1 

QQA3HIRX.011 

QQA3HIRX.021 

QQA3HIRX.03 1 

QQA32UTF.011 

QQA32UTF.02 1 

QQA32UTF.03 1 

QQA6UIF.011 

QQA6U_IF.03 1 

QQA6UJF.041 

QQAM4E3L.011 

QQAM4E3L.02 1 

QQAM4E3L.03 1 

QQARF2_X.011 

QQARF2_X.021 

QQARF2_X.03 1 

QQAWA38X.011 

QQAWA38X.021 

QQAWA38X.031 

QQAYXZGU.0 1 1 

QQAYXZGU.021 

QQAYXZGU.031 


Table  [10]  NSA  Polygraph  files  used  in  sets  1-3. 

Note:  Each  set  consists  of  non-deceptive  files  and  one  of  the  deceptive  sets 
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Appendix  B: 
Program  Listings 


Classify  Program 


%  This  is  a  Matlab  program 
%  This  script  parses  a  matrix  of  polygraph 
%  vectors  into  training  and  testing  vectors. 
%  It  then  calls  the  classifier,  trains,  tests 
%  and  gives  results. 


c  =  2  ;  %  number  of  classes 

percent_train=.75;  %  percentage  of  inputs  used  for  training 


features=[l] 

classification^; 

kk=5; 

change=l; 

repeat=20; 

ut=.5; 

lt=.5; 


%  features  to  use 
%  use  frizzy  classifier 

%  K  in  K  nearest  neighbor 
%  Randomize  training  and  testing  inputs 
%  Number  of  repeatitions 
%  Upper  threshhold  for  3  class  fuzzy  classifier 
%  Lower  threshhold  for  3  class  fuzzy  classifier 


load  set3 1 ;  %  file  containing  feature  matrix 

%  and  vector  that  indicates  whether 
%  column  is  truthful  or  deceptive 

%classvect;  %  vector  of  classes  eg.  1  =  deceptive 

%  0  =  truthful  vector 

featurematrix  =  featmat;  %  matrix  of  features 

dimension  =  size(featurematrix); 

columns  =  dimension(2);  %  the  total  number  of  columns  in  the  feature  matrix 

number_train  =  round(percent_train*columns);  %  number  of  vectors 

%  used  for  training 


ur=.5;  %upper  threshold 

continued;  %  to  repeat  the  program 

while  (continue— 1) 

l _ _ _ _ — 

apercent_classified=[];  %  clear  average  results 

acorrect=[]; 

acc=[]; 

ffresult=[]; 

ccresult=[]; 

ttestclass=[]; 

men=0; 

while(men  ~=7) 

men=menu('Select:', 'Features', 'TypeVK','Random'. . . 
, 'Repeat', '%  training', 'Start’.'Defuzz', 'Exit'); 


if  (men==l) 

'enter  a  vector  of  the  features  you  want  tested  (eg.  [12  4]) 


features  =  input('  ’);  %  features  being  tested 

end 


if  (men— 2) 

classification=menu(,Type:','Fuz2y',,Crisp’); 

end 


if  (men==3) 

kk  =  input('enter  the  "K"  in  K  nearest  neighbor  ’) 
end 


if  (men==4) 

change=menu('Selection','Random','Constant'); 

end 

if  (men==5) 

repeat=input('Enter  number  of  repeatitions') 
end 

if  (men==6) 

percent_train=input('Enter  percentage  of  the  files  used  for  training,  1  for  all-1') 
end 

if  (men==8) 

ch=menu('Defuzzification',  '3class',  'Upper  thresh', 'Lower  thresh'); 
ifch— 1,  classification=3,  end 

if  ch==2 

ut=input('enter  the  upper  threshhold');  %  lower  limit  for  class  1 
end 

if  ch— 3 

lt=input('enter  the  lower  threshhold');  %upper  limit  for  class  0 
end 

end 

if  (men==9)  break, end 
end 

if  men==9  break,end 

number_train  =  round(percent_train*columns); 

acorrect=[];  %  vector  for  the  average  of  correct  classification 

acc=[];  %  vector  for  the  average  of  performance  index 

if  percent_train  =  1  %  To  repeat  nonrandom  testing  for  all  the  files. 

repeat  =columns; 
end 

for  trial=l:  repeat 

featurematrix  =  featmat(features,:);  %  creates  a  feature  matrix  of  the 

%  the  features  being  tested 
if  ( (change=l)  &  (percent_train~=l) ) 

[trainvect,  testvect]  =  randvect(number_train, columns); 

end; 

if  percenttrain  ==  1 

testvect  =  trial; 
if  (trial  —1) 

trainvect=2:columns; 
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end 

if  (trial  =  columns) 

trainvect=l  :columns-l; 
end 

if  ( trial  ~=1  &  trial -^columns  ) 

trainvect=  [l:trial-l ,  trial+l:columns]; 
end 
end 

testvect 

trainvect 

u  =  featurematrixC, testvect);  %  testing  matrix 

testclass  =  classvect(  1  .testvect);  %  class  of  each  column  in  testing  matrix 

p  =  featurematrixC, trainvect);  %  training  matrix 

t  =  classvect(  1  .trainvect);  %  class  of  each  column  in  training  matrix 


if  classification  =1  %  Fuzzy  classifier 

%  m  =  input('enter  the  degree  of  fuzziness  "M"  (l<=M<=infinfity)') 

m  =  2; 

save  fdatafil  c  kk  m  p  t  u 

%  !fknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dos('del  foutfile.mat|')  %to  make  sure  that  the  program  actulally  works 

dos('fknn|') 

'Now  loading  the  result  of  the  fuzzy  classifier' 
load  foutfile 

I _ _ _ _ _ _ _ _  ___t 

kk,  features 

fresult 

testclass 

if(percent_train==l) 

ffresult=[flresult  fresult] 
ttestclass=[ttestclass  testclass]; 
end 

cr  =fresult(2,:)  >  ut  %  defuzzification  of  the  result 

correct  =  100*(l-mean(abs(testclass-cr)))  %  percentage  correct  classified 

cc  =  [1-testclass;  testclass];  %  adding  a  row  of  complements  to  c 

cc=fresult-cc; 

'Performance  Index- 
cc  =  sqrt(mean(mean(cc  .A  2))) 
end 

if  classification  =  2  %  crisp  classifier 

save  cdatafil  c  kk  p  t  u 

%  Icknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dos('del  foutfile.  matf)  %to  make  sure  that  the  program  actulally  works 

dos('cknn|') 

'Loading  the  Crisp  output  file' 
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load  coutfile 

i _ « 

kk,  features 

cresult 

testclass 

if(percent_train=  1) 

ccresult=[ccresult  cresult] 
ttestclass=[ttestclass  testclass]; 
end 

correct  =  100*(l-mean(abs(testclass-cresult)))  %  percentage  correct  classified 
cc  =  sqrt(mean(abs(testclass-cresult)))  %  performance  index 

end 

if  classification  =  3  %  Fuzzy  classifier  but  defuzzification  into  3  classes 

%  m  =  inputf'enter  the  degree  of  fuzziness  "M"  (l<=M<=infinfity)') 
m  =  2; 

save  fdatafil  c  kk  m  p  t  u 

%  ifknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dos('del  foutfile.matD  %to  make  sure  that  the  program  actulally  works 

dos('fknn|') 

'Now  loading  the  result  of  the  fuzzy  classifier' 
load  foutfile 

I _ l 

kk,  features 

fresult 

testclass 

if(percent_train=l) 

ffresult=[ffresult  fresult] 
ttestclass=[ttestclass  testclass]; 
end 

class  1  =find(fr  esult(2 , : )  >ut); 
class0=find(fresult(2,:)  <lt); 
class3=find(fresult(2,:)  >lt  &  fresult(2,:)  <ut); 

percent_classified=100*((length(class0)+length(classl))/length(testclass)) 
fr=[fresult(:,classl)  fresult(:,classO)]  %  the  section  that  is  classified  into  one  of  the  two 

classes 

cr=fr(2,:)>ut 

tr=[testclass(classl)  testclass(classO)]  %  the  section  that  is  classified  into  one  of  the  two 

classes 

correct  =  100*(l-mean(abs(tr-cr)))  %  percentage  correct  classified 
cc  =  [1-tr;  tr];  %  adding  a  row  of  complements  to  cc 
cc^fr-cc; 

'Performance  Index==' 
cc  =  sqrt(mean(mean(cc  .A  2))) 
end 

apercent_classified  =  [apercent_classified  percent_classified] 
acorrect=[acorrect  correct] 
acc=[acc  cc] 
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end 


%  for  trial 


if  classification  —3  %  3  class  fuzzy 

apercent_classified=mean(apercent_classified) 

end 

acorrect,  mean(acorrect) 
acc,  mean(acc) 

continue=3; 

while  (continue  ==  3  |  continue==4) 
continue=menu('Repeat?',  'Yes',  W,'Plot',  'threshold'); 
if(continue==3) 

dim=menu('Dimension',  'Two',  'Three')+1; 
if(dim=2) 

pp=p(:,fmd(t»; 
plot(pp(l,:),pp(2,:),’r+'); 
title('A  clustering  of  two  class  data'); 
hold  on 

pp=p(:  ,find(t=0)); 
plot(pp(l,:),  pp(2,:),  ’gx’); 

pp=u(:,  fmd(testclass)); 
plot(pp(l,:),  pp(2,:),  ’r+’); 
pp=u(:,find(testclass==0)); 
plot(pp(l,:),  pp(2,:),  'gx'); 

hold  off 

end  %if(dim==2) 


if(dim— 3) 


pp=p(:,fmd(t)); 

plot3(pp(l,:),pp(2,:),  pp(3,:),  'r+'); 
title(’A  clustering  of  two  class  data'); 
hold  on 

pp=p(:,find(t=0)); 
plot3(pp(l,:),  pp(2,:),  pp(3,:),  'rx'); 

pp=u(:,  find(testclass)); 
plot3(pp(l,:),  pp(2,:),  pp(3,:),  ’g+'); 
pp=ai(:,find(testclass==0)); 
plot3(pp(l,:),  pp(2,:),  pp(3,:),  'gx'); 

hold  off 

end  %if(dim— 3) 

end  %if(continue==3) 


if  (continue==4) 


ch=menu('Defuzzification',  '3class',  'Upper  thresh'.'Lower  thresh'); 
ifch==l,  classification=3,  end 


ifch=2 

ut=input('enter  the  upper  threshhold');  %  lower  limit  for  class  1 
end 

ifch—3 

lt=input('enter  the  lower  threshhold');  %upper  limit  for  class  0 
end 

if  classification=l 

cr  =flresult(2,:)  >  ut  %  defuzzification  of  the  result 

correct  =  100*(l-mean(abs(ttestclass-cr)))  %  percentage  correct  classified 

cc  =  [1  -ttestclass;  ttestclass] ;  %  adding  a  row  of  complements  to  c 

cc=ffresult-cc; 

'Performance  Index=' 
cc  =  sqrt(mean(mean(cc  .A  2))) 
end 

if  classification=2 

correct  =  100*(l-mean(abs(ttestclass-ccresult)))  %  percentage  correct  classified 
cc  =  sqrt(mean(abs(ttestclass-ccresult)))  %  performance  index 
end 

if  classification=3 

class  l=find(ffresult(2,:)  >ut); 
class0=find(ffresult(2,:)  <lt); 
class3  =find(ffresult(2, :)  >lt  &  ffresult(2,:)  <ut); 

fr=[ffresult(:,classl)  flresult(:,classO)]  %  the  section  that  is  classified  into  one  of 

the  two  classes 

cr=fr(2,:)>ut 

tr=[ttestclass(classl)  ttestclass(classO)]  %  the  section  that  is  classified  into  one  of 

the  two  classes 

percent j:lassified=100*((length(class0)+length(classl))/length(ttestclass)) 
correct  =  100*(l-mean(abs(tr-cr)»  %  percentage  correct  classified 
cc  =  [1-tr;  tr];  %  adding  a  row  of  complements  to  cc 
cc=fr-cc; 

'Performance  Index- 
cc  =  sqrt(mean(mean(cc  ,A  2))) 
end 
end 

end  %  while  continue  =  3  |  4 

end  %  while  continue 
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This  program  implements  a  K-nearest  neighbor  classifier, 
created  by:  Shahab  Layeghi 

created:  8/4/93 
last  modified:  9/17/93 


/*  The  main  program  opens  a  matlab  data  file,  reads  the  training  matrix, 
classifies  each  entry  in  the  testing  matrix,  and  writes  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "cdatafil.mat".  As  the  name  implies  it  is  in  matlab  file  format. 

The  data  in  this  file  should  have  the  following  order: 

1.  A  single  variable  *C'  which  is  the  number  of  classes. 

2.  A  single  variable  'K'  which  is  the  parameter  'K'  in  K-NN  Algorithm. 

3.  A  trainig  matrix  'P'  which  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

4.  A  classes  vector  T  which  contains  the  classes  of  the  training  set 

5.  An  input  matrix  'IT  which  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  CrispKNN  routine  to  classify  each  one  of  the  input 
vectors  and  saves  the  results  (the  classes  that  these  inputs  belong  to)  in  a 
file  called  coutfile.mat.  This  file  is  in  Matlab  format.  This  file  contains 
a  vector  of  the  classes  called: 

'cresult' 


This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escpae 
character '!'.  An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "cknntest.m". 

*1 

#include  <stdio.h> 

#include  <stdlib.h> 

#include  <time.h> 

#include  <math.h> 

#include  <conio.h> 

#define  INPUTFILE  "cdatafil.mat" 

#define  OUTPUTFILE  "coutfile.mat" 

//  Function  Prototypes - 

int  CrispKNN(double  *Input,  double  *Samples,  double  Tables); 
double  FindDistance(double  *vecl,  double  *vec2); 
double  Maxd(double  *vec,  int  *index,  int  Length); 
int  FindMax(int  *vector,  int  *  count,  int  Length,  int  Max); 
int  loadmat(FILE  *  fp.int  *type,  char  *pname,  int  *mrows,  int  *ncols, 
int  *imagf,  double  **preal,  double  **pimag); 
void  savemat(FILE  *fp,  int  type,  char  *pname,  int  mrows,  int  ncols, 
int  imagf,  double  *preal,  double  *pimag); 


//  Global  variables,  these  variables  will  be  set  by  reading  matlab  file 

int  classes;  /*  the  number  of  classes  */ 

int  features;  /*  Number  of  features  in  a  class  */ 


int  KK ; 
int  SampleSize; 
int  TestSize; 


/*  K  in  K-nearest  neighbors  */ 

/*  Number  of  Labled  Samples  */ 


/* _ *1 

void  main() 

{ 


double  *Lables; 
double  *KP; 
double  *input; 
int  i  j; 

FILE  *fjp; 
char  name[20]; 
int  type,  imagf; 

double  *  Samples,  *isamples;  //  isamples  is  for  imaginary  part  of  the  matrix  that  is  not  used  in 

here. 

double  *Testdata; 
double  ^result; 
fp==fopen(INPUTFILE,"rb"); 

if(!fp)  { 

printf("cannot  open  the  file"); 
exit(-l); 

} 

//  read  classes  from  the  file 

loadmat(fp,  &type,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 
if(i!=l  ||  j!=l)  { 

printfC'error:  You  should  include  classes  at  the  beginning  of  the  file\n"); 
exit(-l); 

} 

classes=*KP; 

//  read  KK  from  the  file 

loadmat(fp,  &type,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 
if(i!=l  ||j!=l){ 

printf("error:  You  should  include  K  at  the  beginning  of  the  file\n"); 
exit(-l); 

} 

KK=*KP; 

//  read  the  matrix  from  the  datafile. 

loadmat(fp,  &type,  name,  &features,  &SampleSize,  &imagf,  &Samples,  &isamples); 

//  reading  tables  from  data  file 

loadmat(fp,  &type,  name,  & i,  &j,  &imagf,  &Lables,  &isamples); 
if(i!=l  ||  j!=SampleSize)  { 

printfC'error:  Number  of  labels  is  different  from  the  number  of  samples'^"); 
exit(-l); 
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//  read  data  to  be  classified  from  the  file 

loadmat(fp,  &type,  name,  &i,  &TestSize,  &imagf,  &Testdata,  &isamples); 
if(i  !=  features)  { 

printf("error:  Training  and  testing  matrices  should  have  the  same  size"); 
exit(-l); 

} 

//  Allocate  space  for  result  vector 

result  =  (double  *)  malloc(TestSize*sizeof(double)); 
if(!result)  { 

printf("Error:  cannot  allocate  memory  for  the  result  vector"); 
exit(-l); 

} 

for(i=0;  i<TestSize;  i++)  {  //  for  each  input 

input=Testdata+i*features; 
result[i]=CrispKNN(input,  Samples,  Lables); 

//  printf("class:  %lf\n",  result[i]); 

} 

fclose(fp); 

//  printf("\n  End  of  classification.  Now  writing  the  result  in  the  file"); 
fp=fopen(OUTPUTFILE,  "wb"); 

if(!fp)  { 

printf("Error:  Cannot  write  the  file"); 
getch(); 

} 

savemat(fp,  0,  "cresult",  1,  TestSize,  0,  result,  result); 
fclose(fp); 

> 

/* - */ 

int  CrispKNN(double  *  Input,  double  "“Samples,  double  "“Lables) 

{ 

int  i j  ; 
int  nj,  k,  nk; 
double  "“distance; 
int  "“index; 
double  x,y; 

distance  =  (double  *)  malloc(KK*sizeof(double)); 
if(!distance)  { 

printf("Error:  Not  enough  memory  for  distance  vector"); 
exit(-l); 

} 

index  =  (int  *)  malloc(KK*sizeof(int)); 
if(!index)  { 

printf("Error:  Not  enough  memory  for  index  vector"); 
exit(-l); 

} 
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for(i=0;  i<KK;  i++)  {  //  This  loop  initializes  K  nearest  neighbors  to  the  first  K  Samples 
index[i]=Lables[i]+l ; 

distance[i]=FindDistance(Input,&Samples[i*features]); 

} 

for(i=KK;  i<SampleSize;  i++)  {  //  This  is  the  loop  that  finds  the  K  nearest  Neighbors 

x=Maxd(distance,  &j,  KK); 
y=FindDistance(Input,&Samples[i*features]); 

if(y  <  x)  {  //  This  sample  is  closest  to  the  input  than  the  farthest  K  Neighbors 
distance  [j]=y; 
index[j]=Lables[i]+ 1 ; 

} 

} 

j=FindMax(index,  &nj,  KK,  classes);  //  Finds  the  class  of  maximum  occurance 

/*  In  this  section  it  is  checked  to  see  if  there  is  a  tie.  That  is  if 
there  are  two  or  more  classes  with  the  same  number  of  occureances.  If 
there  is  a  tie  for  two  classes,  the  class  with  the  minimum  sum  of 
distances  is  selected.  No  action  is  taken  for  a  tie  of  more  than  two 
classes.  */ 

for  (i=0;  i<KK;  i++) 

if(index[i]=j)  index[i]=0; 
k=FindMax(index,  &rik,  KK,  classes); 
if(nk=nj)  {  //If  there  is  a  tie. 

x=0; 

for(i=0;  i<KK;  i++)  { 
if(index[i]=0) 

x+=distance[i]; 

} 

y=0; 

for(i=0;  i<KK;  i++)  { 
if(index[i]=k) 

y+=distance[i]; 

} 

if(y<x)  //If  sum  of  the  distances  to  class  j  is 

less  than  that  of  class  k 

j=k; 

} 

free(distance); 
free(index); 
return  j-1; 


> 

/*  This  function  returns  the  Euclidian  distance  between  two  vectors  */ 

double  FindDistance(double  *vecl,  double  *vec2) 

{ 


int  k; 

double  distance; 
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distance  =  0; 

for(k=0;  k<features;  k++)  { 

distance  +=(vecl  [k]-vec2[k])*(vecl  [k]-vec2[k]); 

I  I  distance  +=  pow(vecl  [k]-vec2[k] ,  2); 

} 

return  distance; 

> 

/*  This  function  finds  the  biggest  element  of  an  array.  It  returns  that 
value  and  also  returns  the  index  to  that  element  in  index. 

*/ 

double  Maxd(double  *vec,  int  *index,  int  Length) 

{ 

int  ij=0; 
j=°; 

for(i=l;  i<Length;  i++) 

if(vec[i]>vec[j])j=i; 

*index=j; 

return(vec[j]); 

> 

/* _ */ 

/*  This  function  finds  a  number  that  is  most  often  repeated  in  an  array  of 
integer  values,  and  returns  that  number.  Length  of  array  shoud  be  less  than 
100.  It  is  supposed  that  number  is  an  integer  greater  than  zero, 
vector  is  a  pointer  to  the  array,  count  is  the  number  of  times  that  the 
number  is  repeated.  Length  is  the  length  of  the  vector. 

*/ 

int  FindMax(int  "“vector,  int  "‘count,  int  Length,  int  Max) 

{ 

int  i,  j,  m; 
intt[101]; 

if(Max>100)  Max=100; 
for(i=0;  i<Max+l;  i++) 
t[i]=0; 

for(i=0;  i<Length;  i++) 
t[vector[i]]++; 

m=t[l]; 

j=i; 

for(i=l;  i<Max+l;  i++)  { 
if(t[i]>m)  { 

m=t[i]; 

j=i; 

} 

} 

*count=m; 
return  0);  } 
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/*  This  program  implements  a  fuzzy  version  of  K-nearest  neighbor  classifier, 
created  by:  Shahab  Layeghi 

created:  9/1/93 
last  modified:  9/3/93 

*/ 

/*  The  main  program  opens  a  matlab  data  file,  reads  the  training  matrix, 
classifies  each  entry  in  the  testing  matrix,  and  writes  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "fdatafile.mat”.  As  the  name  implies  it  is  in  matlab  file  format. 

The  data  in  this  file  should  have  the  following  order: 

1 .  A  single  variable  'C'  which  is  the  number  of  classes. 

2.  A  single  variable  ’K’  which  is  the  parameter  'K'  in  K-NN  Algorithm. 

3.  A  single  variable  'M'  which  is  the  coefficient  in  fuzzy  algorithm. 

4.  A  trainig  matrix  'P'  which  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

5.  A  class  membership  matrix  T  which  contains  the  membership  values  of  the 
training  set  vectors  to  the  classes. 

6.  An  input  matrix  *U'  which  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  FuzzyKNN  routine  to  classify  each  one  of  the  input 
vectors  and  saves  the  results  (the  classes  that  these  inputs  belong  to)  in  a 
file  called  "foutfile.mat".  This  file  is  in  Matlab  format.  This  file  contains 
a  single  variable  called  fiesult.  It  is  a  vector  of  the  classes. 

This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escpae 
character  ’!'.  An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "fknntest.m". 

*/ 

#include  <stdio.h> 

#include  <stdlib.h> 

#include  <time.h> 

#include  <math.h> 

#include  <conio.h> 

^define  INPUTFILE  "fdatafil.mat" 

#define  OUTPUTFILE  "foutfile.mat" 

//  Function  Prototypes - 

void  FuzzyKNN(double  *Input,  double  *Samples,  double  *Lables,  double  *Result); 
double  FindDistance(double  *vecl,  double  *vec2); 
double  Maxd(double  *vec,  int  *  index,  int  Length); 
int  FindMax(int  ^vector,  int  ♦count,  int  Length,  int  Max); 
int  loadmat(FILE  *  fp.int  *type,  char  *pname,  int  *mrows,  int  *ncols, 
int  *imagf,  double  **preal,  double  **pimag); 
void  savemat(FILE  *fp,  int  fype,  char  *pname,  int  mrows,  int  ncols, 
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1 1  read  M  from  the  file 

loadmat(fp,  &type,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 

^  ^  printfC'error:  You  should  include  M  as  the  thrid  parameterin"); 

exit(-l); 

} 

M=*KP; 


// read  the  matrix  from  the  datafile. 

loadmat(fp,  &type,  name,  &features,  &SampleSize,  &imagf,  &Samples,  &isamples). 


//  reading  lables  from  data  file 

loadmat(fp,  &type,  name,  & i,  &j,  &imagf,  &Lables,  &isamples), 

if(i!=l  ||j!=SampleSize)  {  ,  , 

printfC'error:  Number  of  labels  is  different  from  the  number  of  samples\n  ); 

exit(-l); 

} 


//read  data  to  be  classified  from  the  file 

loadmat(fp,  &type,  name,  &i,  &TestSize,  &imagf,  &Testdata,  &isamples); 

if(i  !=  features)  {  . 

printfC'error:  Training  and  testing  matrices  should  have  the  same  size  ), 

exit(-l); 

} 


//  Allocate  space  for  result  vector 


result  =  (double  *)  malloc(TestSize*Classes*sizeof(double)); 
if(!result)  { 

printfC'Error:  cannot  allocate  memory  for  the  result  Matrix  ); 
exit(-l); 

> 


// 


// 


for(j=0;  j<TestSize;  j++)  {  //  for  each  input 

input=Testdata+j*features; 
FuzzyKNN(input,  Samples,  Lables,  iresult); 
printf("\n  Memberships:''); 
for(i=0;  i<Classes;  i++)  { 

result  0  *  Classes+i] =iresult[i] ; 
printf("  %lf ",  iresult[i]); 

} 


} 

fclose(fp);  .  , 

printf("\n  End  of  classification,  Now  writing  the  result  in  the  file  ); 


fp=fopen(OUTPUTFlLE.  "wb"); 

if(!fp)  ( 

printf("Error:  Cannot  write  the  file''); 
getch(); 

savemat(fp,  0,  "fresult".  Classes,  TestSize,  0,  result,  result); 
fclose(fp); 
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} 


/* - */ 

/*  This  is  a  fuzzy  K  Nearest  neighbor  classifier  routine.  Input  is  the 
vector  to  be  classified,  Samples  is  the  matrix  of  classified  samples, 

Lables  is  the  vector  of  the  classes  that  these  samples  belong  to. 

Result  is  the  vector  of  membership  values  of  Input  to  each  class. 

*/ 

void  FuzzyKNN(double  *Input,  double  *Samples,  double  *  Lables,  double  *Result) 

{ 

int  ij,n ; 
int  nj,  k,  nk; 
double  *distance; 
int  *index; 
double  x,y; 

double  "“membership;  //  pointer  to  membership  matrix 

double  nsum,  dsum,  temp; 

/*  This  section  builds  a  fuzzy  membership  matrix  from  the  lables. 

Membership  of  each  sample  to  the  class  that  it  belongs  to  is  assigned 
to  1,  and  the  membership  of  it  to  other  classes  is  assigned  to  0  */ 

membership  =  (double  *)  malloc(SampleSize*Classes*sizeof(double)); 
if(!membership)  { 

printf("Error:  Not  enough  memory  for  membership  matrix"); 
exit(-l); 

} 

for(i=0;  i<SampleSize*CIasses;  i++) 

*(membership+i)=0;  //  Initializing  matrix  to  zero 

for(j=0;  j<SampleSize;  j++)  { 
i=*(Lables+j); 

*(membership+i*SampleSize+j)=  1 ; 

} 

distance  =  (double  *)  malloc(KK*sizeof(double));  //  allocate  space  for  the  vector 
if(!distance)  { 

printf("Error:  Not  enough  memory  for  distance  vector"); 
exit(-l); 

} 

index  =  (int  *)  malloc(KK*sizeof(int)); 
if(!index)  { 

printf("Error:  Not  enough  memory  for  index  vector"); 
exit(-l); 

} 

for(i=0;  i<KK;  i++)  {  //  This  loop  initializes  K  nearest  neighbors  to  the  first  K  Samples 
index[i]=i; 

distance[i]=FindDistance(Input,  &Samples[i*features]); 

> 

for(i=KK;  i<SampleSize;  i++)  {  //  This  is  the  loop  that  finds  the  K  nearest  Neighbors 
x=Maxd(distance,  &j,  KK); 
y=FindDistance(Input,  &Samples[i*features]); 

if(y  <  x)  {  //  This  sample  is  closest  to  the  input  than  the  farthest  K  Neighbors 
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distance[i]=y; 

index[j]=i; 

} 

} 

for(j=0;  j<Classes;  j++)  { 
nsum=dsum=0; 
for(n=0;  n<KK;  n++)  { 
i=index[n]; 

temp=FindDistance(Input,  &Samples[i*features]); 
if(temp  <  le-10)  { 

zero 

Result[j]=membership[j*SampleSize+i]; 

break; 

} 

if(M  =  2) 

temp=l/temp; 
else  if(M  !=  1) 

temp=pow(  1/temp,  1/(M-1)); 
else 

temp=0; 

nsum  +=  membership[j*SampleSize+i]*temp; 

dsum  +=  temp; 

} 

if(dsum  !=0) 

Result[j]=nsum  /  dsum; 

> 

free(membership); 

free(distance); 

free(index); 

} 


/*  This  function  returns  the  Euclidian  distance  between  two  vectors  */ 

double  FindDistance(double  *vecl,  double  *vec2) 

{ 

intk; 

double  distance; 
distance  =  0; 

for(k=0;  k<features;  k++)  { 

distance  +=  (vecl[k]-vec2tk])*(vecl[k]-vec2[k]); 
//  distance  +=  pow(vecl[k]-vec2[k] ,  2); 

> 

return  distance; 

} 

_ _ *1 

I*  This  function  finds  the  biggest  element  of  an  array.  It  returns  that 
value  and  also  returns  the  index  to  that  element  in  index. 

*/ 


//If  distance  is 
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double  Maxd(double  *vec,  int  *index,  int  Length) 

{ 

int  i  j=0; 
j=0; 

for(i=l;  i<Length;  i++) 

if(vec[i]>vec[j])  j=i; 

*index=j; 

retum(vec[j]); 

} 


/*  This  function  finds  a  number  that  is  most  often  repeated  in  an  array  of 
integer  values,  and  returns  that  number.  Length  of  array  shoud  be  less  than 
100.  It  is  supposed  that  number  is  an  integer  greater  than  zero, 
vector  is  a  pointer  to  the  array,  count  is  the  number  of  times  that  the 
number  is  repeated.  Length  is  the  length  of  the  vector. 

*/ 

int  FindMax(int  ^vector,  int  *count,  int  Length,  int  Max) 

{ 


inti,j,  m; 
intt[101]; 

if(Max>100)  Max=100; 
for(i=0;  i<Max+l;  i++) 
t[i]=0; 

for(i=0;  i<Length;  i++) 
t[vector[i]]++; 

m=t[l]; 

j=i; 

for(i=l;  i<Max+l;  i++)  { 
if(t[i]>m)  { 

m=t[i]; 

j=U 

} 

} 

*count=m; 
return  (j); 


> 
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§2.  INTRODUCTION 


2.1.  POLYGRAPH1 


2.7.7.  Preview: 


Polygraph  examinations  are  the  most  widely  used  method  to  distinguish  between  truth  and 
deception.  In  a  Polygraph  examination  a  person  is  connected  to  a  special  instrument  called 
a  Polygraph  which  records  several  physiological  signals  such  as  blood  pressure,  Galvanic 
Skin  Response,  and  respiration.  The  subject  is  asked  a  set  of  questions  by  an  examiner.  By 
looking  at  these  signals  the  examiner  is  able  to  determine  the  reactions  of  the  subject  to 
the  questions  and  decide  whether  the  person  was  truthful  or  deceptive  in  answering  each 
question. 

The  problem  with  human  classification  of  Polygraph  tests  is  that  the  outcome  depends  on 
the  examiner's  experience  and  personal  opinion.  Automatic  scoring  of  Polygraph  tests  has 
been  a  subject  of  extensive  research.  Several  methods  for  Polygraph  classification  have 
been  studied  which  are  mostly  based  on  statistical  classification  techniques. 

Digitized  Polygraph  data  used  in  this  project  were  collected  from  various  police  stations. 
The  data  files  were  organized  according  to  the  test  format  used  and  were  decoded  to 
ASCII  format  so  they  can  be  read  by  Matlab.  Preprocessing  and  feature  extraction 
routines  were  implemented  in  the  Matlab  language  in  privious  works  [Layeghi  1993,1] 
[Dastmalchil993][Jacobsl993].  Three  sets  of  files  were  chosen,  each  one  of  them 
contained  50  deceptive  and  50  non-deceptive  files. 

These  files  are  listed  in  the  appendix,  Fig.42. 

2.7.2.  History: 

The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception  occurred 
around  1 895  [Reid 1966],  That  was  the  year  that  Caesar  Lombroso  published  the  results  of 
his  experiments  in  which  a  hydro sphygmograph  was  used  to  measure  the  blood  pressure- 
pulse  changes  of  criminals  in  order  to  determine  whether  or  not  they  were  deceptive. 
Although  the  hydrosphygmograph  was  originally  intended  to  be  used  for  medical 


•Portions  of  this  section  were  extracted  from  [Layeghi  1993,1]  using  particularly  [Cappsl992]  [01senl983] 
[Reid 1966], 
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purposes,  Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may  have  been 
the  first  to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a  suspect  a  series 
of  photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the  suspect  did  not 
react  more  to  the  victims  picture  than  the  pictures  of  the  other  children,  Lombroso 
concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and  therefore  was  not 
the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by  measuring 
recorded  respiration  tracings  [Capps 1992].  He  found  that  if  the  length  of  inspiration  were 
divide  by  the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before  lying 
and  also  before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson 
constructed  an  instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and 
respiration  during  an  examination  [Reid  1966]  [Capps  1992],  Larson  reported  accurate 
results  which  prompted  Leonarde  Keeler  to  construct  a  better  version  of  this  instrument  in 
1926  [Reid  1966]  [Cappsl992], 

The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the  century.  It's 
usefulness,  however,  did  not  become  evident  until  the  1930's  during  which  time  several 
articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New  York 
[Cappsl992],  In  these  articles  he  reports  over  90  criminal  cases  in  which  examination 
using  the  galvanic  skin  response  had  all  been  successful  and  confirmed  by  confession  or 
supplementary  evidence. 

The  usefulness  of  the  galvanic  skin  response  prompted  Keeler  to  add  an  galvanometer  to 
his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler  Polygraph  recorded  blood 
pressure-pulse,  respiration,  and  galvanic  skin  response  [Reid  1966], 


2./.  3.  Modem  Test  Formats: 


The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant  questions 
about  an  issue,  control  questions  that  provide  a  physical  response  for  comparison,  and 
irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response  for  comparison 
[01senl983][Cappsl992], 
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Three  general  types  of  test  formats  are  in  use  today.  These  are  Control  Question  Tests, 
Relevant-Irrelevant  Tests,  and  Concealed  Knowledge  Tests.  Each  of  the  general  test 
formats  may  have  a  number  of  more  specific  variations.  Each  examination  consists  of  two 
to  five  sessions  containing  a  prescribed  series  of  questions.  The  test  format  that  is  used  in 
an  examination  is  determined  by  the  test  objective  [Reidl966]  [Cappsl992]. 

1.  The  Concealed  Knowledge  Test,  also  called  peak  of  tension  test,  is  used  when  facts 
about  a  crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a 
subject  would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example,  if 
a  gun  was  used  in  a  crime  and  the  public  did  not  know  the  caliber,  an  examiner  could  ask  a 
suspect,  if  it  was  a  22  caliber,  a  38  caliber,  or  a  9  mm.  If  the  gun  used  was  a  9  mm  and  the 
suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of  deception. 

2.  A  Control  Question  Test2  is  often  used  in  criminal  investigations.  In  this  type  of  test  a 
series  of  relevant,  irrelevant,  and  control  questions  are  asked: 

•  A  relevant  question  is  one  which  is  specific  to  the  crime  being  investigated. 

For  example,  "Did  you  steal  the  money?". 

•  A  control  question  is  designed  to  make  the  subject  feel  uncomfortable.  It 
is  not  specific  to  the  crime  being  investigated  however  it  may  be  related  in 
an  indirect  way.  A  control  question  that  could  follow  the  relevant  question 
stated  above  is  "Have  you  ever  taken  anything  that  did  not  belong  to  you?". 

The  control  questions  are  compared  to  the  relevant  questions  and  if  the 
responses  to  the  relevant  questions  are  greater,  the  subject  is  usually  classified 
as  deceptive. 

•  Irrelevant  questions  are  used  as  buffers.  Examples  of  irrelevant  questions  are 
"Are  the  lights  in  this  room  on?"  or  "Is  today  Monday?". 

3.  Relevant-Irrelevant  Tests  are  usually  used  to  test  people  trying  to  obtain  security 
clearance  or  get  a  job.  In  this  test,  relevant  questions  are  compared  to  irrelevant  questions. 
Very  few  control  questions  are  asked.  The  purpose  of  control  questions  in  this  test  is  to 
make  sure  that  the  subject  is  capable  of  reacting  at  all. 


2  It  was  decided  to  use  this  method  in  our  project  (as  it  was  also  in  previous  works). 
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2. 1. 4.  Present  Day  Equipment 


The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in  1945 
and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [01senl983],  The 
Reid  polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biological  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 
The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way. 

Both  machines  record  the  same  biological  signals  using  standard  methods.  Blood  pressure 
is  measured  by  placing  a  standard  blood  pressure  cuff  on  the  arm  over  the  brachial  artery. 
Respiration  is  monitored  by  placing  rubber  tubes  around  the  abdominal  area  and  the  chest 
of  the  subject.  This  results  in  two  signals,  a  lower  and  upper  respiratory  signal.  Skin 
conductivity  is  measured  by  placing  electrodes  on  two  fingers  of  the  same  hand. 

The  focus  of  this  thesis  is  to  investigate  two  different  fuzzy  pattern  recognition  algorithms 
using  the  aforementioned  signals. 
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2.2.  PATTERN  RECOGNITION  UTILIZING  FUZZY  TOOLS 

2.2.1.  Whv  the  "FUZZY"  approach? 

While  observing  the  history  of  science,  we  notice  that  one  of  its  major  goals  has  always 
been  what  we  call  today  "pattern  recognition".  Having  this  in  mind,  man  created  models, 
functional  relationships  and  mathematical  tools  to  come  closer  to  a  perfect  and  precise 
model  for  almost  every  area  of  the  nature  and  our  being.  In  fact,  "precision"  became  more 
and  more  important,  to  the  extent  that  an  imprecise  model  was  a  bad  model  by  default. 

1965  Lotfi  A.  Zadeh  introduced  in  his  innovative  paper  [Zadehl965]  an  " imprecise " 
structure  for  mathematical  observation;  Hence,  the  fuzzy  set  was  bom.  A  companion  to 
the  classical  one  with  often  more  useful  and  suitable  representation  of  our  environment. 

"The  fuzzy  set  was  conceived  as  a  result  of  an  attempt  to  come  to  grips  with  the  problem 
of  pattern  recognition  in  the  context  of  imprecisely  defined  categories.  In  such  cases,  the 
belonging  of  an  object  to  a  class  is  a  matter  of  degree,  as  is  the  question  of  whether  or  not 
a  group  of  objects  form  a  cluster";  These  were  the  introductory  words  from  L.A.  Zadeh  in 
[Bezdekl981],  They  summarize  the  fundament  of  any  fuzzy  clustering  or  classifying 
algorithm  concerning  any  search  of  data  structure  or  pattern  recognition.  This  concept  is 
exactly  what  this  project  is  all  about. 

An  example: 

Imagine,  you  have  two  groups  of  objects  "chairs"  and  "desks"  in  different  varieties.  In  a 
simple  version  of  a  typical  pattern  recognition  problem,  you  have  the  task  to  cluster  or 
classify  the  given  objects  into  these  two  groups.  In  reality,  we  will  also  have  other  objects 
like  a  big  box  or  a  bed  within  the  pool  of  the  objects,  but  only  the  two  aforementioned 
clusters  by  definition.  Now,  a  conventional  crisp  clustering  method  would  put  these 
critical  objects  in  either  one  of  these  two  clusters.  Thus,  the  big  box  or  the  bed  may  be 
labeled  as  if  they  would  be  chairs. 

A  fuzzy  clustering  method  would  label  the  objects  with  soft  membership  values.  In  this 
case,  a  big  box  (that  can  be  used  as  a  chair  or  a  desk)  might  be  labeled  with  0.6  degree 
chair  and  0.4  desk.  Information  like  this  serves  a  useful  purpose  -  "fuzzy  memberships  in 
several  classes  are  a  signal  to  take  a  second  look"  [Bezdekl993]  [Bezdekl992]: 
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Hard  memberships  of  data  cannot  support  this.  Thus,  the  fuzzy  model  provides  a  richer 
and  more  flexible  solution  structure,  one  that  models  the  real  objects  with  a  finer  degree  of 
detail  than  the  harshness  of  the  crisp  models.  Notice  also  that  hard  membership  values 
build  a  subset  of  the  fuzzy  membership3  set. 

There  are  different  types  of  fuzzy  algorithms  to  find  the  appropriate  membership  values 
within  the  data.  In  this  project,  we  used  the  follwoing  two  approaches: 

1.  Clustering  algorithms: 

Given  any  finite  data,  the  problem  of  clustering  is  to  find  similarities  between  the  objects 
of  the  data  and  to  assign  labels  that  matching  objects  would  belong  to  the  same  subgroups. 
The  algorithm  starts  its  search  without  any  initial  interpretative  information  about  the  data 
elements.  It  only  seeks  for  objective  numerical  similarities  between  the  elements.  Because 
the  initial  objects  are  unlabeled,  this  method  is  often  called  "unsupervised  learning".  The 
word  learning*  implies  that  the  clustering  algorithm  will  ultimately  find  the  correct  labels 
at  the  end  of  the  process.  This  is  what  we  hope  to  obtain,  but  we  do  not  know  it  a  priori. 

Notice  that  because  of  the  unsupervised  nature  of  this  algorithm,  we  may  find  "correct" 
clusters  which  represent  some  similarities,  but  not  the  ones  we  were  looking  for.  In  the 
aforementioned  example  with  chairs  and  desks,  the  algorithm  may  provide  two  clusters  of 
"wood-made"  and  "metal-made"  objects  (which  are  also  correct),  but  not  "chairs"  and 
"desks"  as  we  had  hoped  for. 

In  this  case,  the  performance  of  a  clustering  model  is  influenced  by  the  choice  of  the 
parameters5,  features,  geometrical  properties  and  our  eventual  interpretation  of  the  labels. 

2.  Classifying  algorithms: 

In  contrast  to  a  clustering  system  which  labels  a  given  data,  a  classifier  is  capable  -  once  it 
is  defined  (and  trained)  -  of  labelling  every  appropriate  data.  In  addition,  a  classifying 
system  is  ususally  initialized  by  labeled  objects.  In  these  cases,  we  call  this  method 
"supervised  learning". 


3Notice  that  membership  values  are  not  probabilities;  they  are  similarities  of  object  vectors  to  a  class 
structure.  They  represent  the  degree  of  belonging  of  an  object  to  a  group  of  objects. 

4The  word  learning  does  not  imply  any  training.  In  fact,  a  clustering  system  -  as  is  its  nature  -  is  almost 
the  opposite  of  any  system  which  learns  by  training. 

5See  chapter  2.2.3.2.  for  the  meanings  of  the  parameters  and  chapter  3. 1.3. 3.  for  the  strategies  we  used. 
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Notice  that  we  can  also  use  a  clustering  algorithm  as  a  modified  classifying  algorithm: 

After  having  set  the  optimal  combination  of  parameters  and  features,  we  can  use  the 
clustering  system  to  classify  any  new  data  by: 

•  adding  the  new  element  to  a  given  and  already  correct  clustered  data,  and  letting 
the  system  relabel6  the  data.  Thus,  our  new  object  ends  up  to  be  in  one  of  the 
clusters  representing  its  identity, 

•  saving  all  the  parameters,  cluster  centers  and  the  data  elements  and  calculate 
appropriately  the  membership  value  of  the  new  object,  which  will  eventually 
represent  its  identity. 


^Running  a  new  clustering  process  with  one  more  element  will  probably  change  the  structure  of  the 
original  clusters,  because  the  cluster  centers  and  the  membership  values  of  each  element  depend  on  all  of 
the  members.  In  spite  of  this  fact,  we  will  be  able  to  classify  a  normal  (  -  not  an  outlier )  object  by  having 
a  large  number  of  already  clustered  objects  in  a  stable  condition. 
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2.2.2.  Whv  fuzzv-c-means  (FCM)? 


One  of  the  most  significant  characteristic  of  /wzzy-c-means  algorithm  is  its  "fuzziness"7,  as 
the  name  assumes.  Unlike  crisp  clustering  methods,  FCM  gives  us  "membership  functions" 
c  [0,  1]  which  determine  the  grade  of  belongingness  of  the  elements  to  a  cluster.  As 
mentioned  before,  this  information  is  totally  lost  by  conventional  clustering  techniques. 
The  advantage  of  FCM  is  the  fact  that  the  results  we  may  get  from  a  crisp  clustering 
method  are  automatically  within  those  from  FCM. 

We  chose  FCM  as  an  alternative  and  a  comparison  to  the  fuzzy  K-Nearest-Neighbor 
algorithm  (KNN)  investigated  previously  [Layeghil993,l][Dastmalchil993][Jacobsl993], 
specially  because  FCM  is  an  unsupervised  clustering  method  which  works  only  by  using 
"mathematical"  tools  such  as  spatial  distances  or  similarities,  without  any  training  or 
additional  interpretative  information. 

By  this  method,  good8  features  will  then  hopefully  provide  an  optimal  mathematical 
grouping  that  presents  in  some  sense  an  accurate  portrayal  of  natural  structures  in  the 
physical  process  from  where  the  polygraph  data  are  drived. 


Whv  we  chose  FCM  algorithm: 

Because  it 

• 

does  not  need  previous  training, 

• 

does  not  make  any  assumption  about 
the  distribution  of  samples, 

• 

is  unsupervised,  objective  and  self  organized, 

• 

can  be  used  as  an  alternative  and  a  comparison 
to  fuzzv  KNN  investigated  previously. 

i  - — - — 

Fig.l:  FCM  characteristics 

7See  chapter  2. 1. 1.  for  characteristics  of  a  fuzzy  approach. 

8"Good"  features  are  in  our  study  those  which  can  cluster  the  data  in  deceptive  and  truthful  groups. 
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2.2.3.  Fuzzv-c-means  algorithm  and  its  interpretation 


2.2.3./.  FCM  code  -An  iterative  procedure: 


The  fuzzy-c-means  algorithm9  is  basically  an  iterative  procedure  to  minimize  an  objective 
function  Jm  representing  a  spatial  fuzzy  distance  between  data  points  xk  and  cluster 
centers  vf .  In  this  project,  I  chose  the  most  widely  used  Euclidean  distance,  i.e.  the  sum  of 
the  squared  errors  performance  index; 


n  c 


h~vi 

k=l  Z=1 


2 

A 


•  X  =  {xltx2  }  c  91s  is  a  finite  data  set  in  the  pattern  space  91s. 

•  c  is  a  fixed  and  known  number  of  clusters  (here:  c=2). 

•  U  =  [w,J  e  91 c"  is  a  fuzzy  c-partition  of  X,  w*  is  referred  to  as  the  grade  of  membership 
of  xk  to  the  cluster  i.  uik  satisfy  the  following  constraints; 

uik  e[0,l];l  <,i  <c,l  <  k  <n 

C 

=l;l<k<n 

n 

0<^uik  <n;\<i<c 
*= i 


•  v  =  (  Vj,v2,...,vc  )  €91“  ;  each  v/  €9t5  represents  a  prototype  of  class  i. 

•  m  is  the  weighting  exponent  and  represents  the  level  of  fuzziness;  1  <  m  <  oo. 


9[Ruspinil969]  was  the  first  one  who  suggested  the  structure  of  fuzzy-c-partition  spaces.  The  fuzzy-c- 
means  algorithm  (originally  ISODATA)  was  initially  developed  by  [Dunnl974]  and  generalized  by 
[Bezdekl973], 

Dunn  extended  and  developed  the  classical  "within-groups  sum  of  the  squared  errors"  (WGSS)  function  to 
a  fuzzy  clustering  criterion  and  developed  the  fuzzy-c-means  clustering  algorithm  to  minimize  the 
objective  function  through  an  iterative  method.  Bezdek  further  extended  the  fuzzy  objective  function 
proposed  by  Dunn  to  a  more  genral  form  of  fuzzy  clustering  criterion  by  introducing  the  weighting 
exponent  m,  1  <  m  <  ».  It  turns  out  that  Dunn's  function  is  a  special  case  (  m=2  )  of  an  infinite  family  of 
objective  functions. 


4-14 


•  1  xk  ~  vi  lCi  is  an  inner  product  induced  norm  on 


By  differentiation 
fixed,  we  obtain 


Jm(U,v)  with  respect  to  uik  where  v{  is  fixed  and  to  v;  where  U  is 


uik  = 


I 

;= i 


k-v| 


Xk-Vj 


m- 1 


and 

v.  _ . 

2>,*r 

Jk=l 

These  two  equations  cannot  be  solved  analytically,  but  approximate  solutions  can  be 
obtained  by  an  iterative  procedure.  The  FCM  uses  iterative  optimization  of  an  objective 
function  based  on  a  weighted  similarity  measure  between  data  points  and  cluster  centers. 


Step  1.  Input  the  number  of  clusters,  c,  the  weighting  exponent,  m,  and  the  error 
tolerance,  8. 

Step  2.  Input  the  data  X  =  {  Xi,x2,...,xn  }. 

Step  3.  Initialize  the  membership  values  U  =  [uik  ]. 

Step  4.  Calculate  the  new  cluster  centers  by  the  3rd  equation. 

Step  5.  Update  the  by  the  2nd  equation. 

Step  6.  Return  to  Step  3,  if  ||u(,+l)  -U(/)||  >  ey  otherwise  output  U... 

X:  [  sxn  ]  n:  #  of  data  elements  -  polygraph  test  sessions. 

U:  [  cxn  ]  s:  #  of  features  -  dimension  of  the  samples  in  each  cluster. 

V:  [  sxc  ]  c:  #  of  clusters 


Fig.2:  The  iterative  FCM10  procedure 


10See  Fig.3  ,  the  flow  chart  of  the  FCM  code  implemented  in  this  project. 
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Fig.3:  Flow  chart  of  the  FCM  code  implemented  in  this  project 


2. 2. 3. 2.  What  the  influential  parameters  practically,  mean  or  represent, 
andhgw  to  interpret  the  clustering  algorithm  itself) 

The  weighting  exponent  m  represents  the  "fuzziness"  level.  It  controls  the  extent  of 
membership  sharing  among  the  fuzzy  clusters.  Recall  the  example  of  the  two  clusters, 
"desks"  and  "chairs"  in  chapter3.1;  In  a  hard c-means  clustering  environment  (m->  1)  each 
object  can  either  belong  to  "chairs"  or  "desks",  i.e.  its  membership  value  is  either  one  or 
zero  for  each  cluster.  Now,  the  higher  m  is,  the  fuzzier  the  results  will  be.  Thus,  a  desk  - 
.  which  can  also  be  used  as  a  chair-  may  get  a  membership  value  higher  than  zero  for 
belongingness  to  the  chairs  cluster.  In  this  sense,  m  controls  the  membership  values  as 
following 

lim  uik  =  —. 
m-+oo  c 
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The  control  parameter  epsilon  represents  the  interrupt  criterion.  It  influences  the  number 
of  iterations  and  therefore  the  accuracy  of  the  algorithm  which  is  the  search  for  c  minima. 
By  making  epsilon  smaller  we  get  more  accurate  clustering  results,  but  also  more 
computing  time,  which  is  not  important  in  this  specific  case. 

The  algorithm  primarily  gives  us  after  each  iteration  new  cluster  centers  V{  and  new 
membership  values  Uik .  It  then  calculates  the  spatial  distances  between  each  data  element 
and  the  found  cluster  centers  then  checks  the  interrupt  criterion.  If  these  distances  are 
small  enough,  the  algorithm  will  eventually  give  us  the  best  membership  values  and  the 
appropriate  cluster  centers.  At  this  point,  the  search  for  an  internal  structure  within  the 
polygraph  data  -the  original  intention  of  every  clustering  process-  will  be  finished. 

FCM  algorithm  belongs  to  the  so-called  partitional  clustering  algorithms  which  generate  a 
fuzzy  c-patition  matrix  in  a  feature  space.  In  this  project  I  set  the  number  of  clusters  c,  as  a 
known  parameter,  equal  to  two.  It  can  otherwise  be  a  part  of  the  clustering  optimization 
itself.  This  decision  was  made  after  running  some  initial  tests  with  c  =  3  as  well,  which 
represents  "deceptive",  "truthful"  and  "ambiguous"  clusters. 


Fig.4:  Fuzzy  C-means  algorithm  applied  on  polygraph  data 


4-17 


2.2.4.  Whv  LMS  fuzzv  adaptive  filter? 


Filters  are  information  processors.  In  practice,  information11  usually  exists  in  two  different 
modes: 


•  Numerical  data  associated  with  the  problem, 

•  linguistic  descriptions  of  human  experts 
(often  in  the  form  of  fuzzy  IF-THEN  rules) 

Conventional  filters  can  only  process  numerical  data,  whereas  expert  systems  can  only 
make  use  of  linguistic  information,  i.e.  a  successful  pattern  recognition  system  in 
conventional  form  can  only  be  guaranteed  where  either  linguistic  rules  or  numerical  data 
do  not  play  a  critical  role.  Recall  the  fact  that  even  in  those  cases  we  decide  for  a 
numerical  method,  we  use  linguistic  information,  consciously  or  unconsciously,  in  the 
choice  among  different  filters,  the  evaluation  of  filter  performance,  the  choice  of  the  filter 
orders,  the  interpretation  of  filtering  results,  and  so  on. 

The  LMS12  fuzzy  adaptive  filter  is  a  new  kind  of  nonlinear  adaptive  filter  which  makes  use 
of  both  linguistic  and  numerical  information  concerning  the  physical  characteristics  of  the 
polygraph  data  in  their  natural  form.  This  filter  is  constructed  from  a  set  of  changeable 
fuzzy  IF-THEN  rules,  i.e.  we  have  the  choice  of  setting  the  rules  according  to  our 
experiences  and  incorporating  them  directly  into  the  filter,  or  initializing  the  rules 
arbitrarily;  similar  to  the  polynomial,  neural  nets,  or  radial  basis  function  adaptive  filters. 

2.2.5.  LMS  fuzzv  adaptive  filter  and  its  interpretation: 

2.2.5.  L  Filter  code  -  An  adaptive  procedure 

As  stated  before,  this  filter  is  constructed  from  a  set  of  changeable  fuzzy  IF-THEN  rules 
by  matching  input-output  pairs  through  an  adaptation  procedure.  The  adaptive  algorithm 
updates  the  parameters  of  the  membership  functions  which  characterize  the  fuzzy  concepts 
in  the  IF-THEN  rules  by  minimizing  a  criterion  function. 

Consider  a  real-valued  vector  sequence  [x(Jfc)]  and  a  real  valued  scalar  [tf(fc)].  The  adaptive 
filter  fk :  U  ->  R  is  to  determine,  such  that  L  =  E[(d(k)-fk(x(k)))2  ]  is  minimized. 


11  About  the  pattern  of  the  subject  to  be  studied. 
12LMS  =  Least  Mean  squares 
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With  k  =  1,2,3,...  and  x(k)  et/  =  [C," ,  C,+ ]  x [C' ,  C2+  ]  x  •  •  •  x  [C; ,  C„+  ]  c  7T .  U  and  R  are 
the  input  and  output  spaces  of  the  filter,  respectively. 

The  following  steps  describe  the  LMS  fuzzy  adaptive  filter13  used  in  this  project: 

Step  1:  M  fuzzy  sets  Ff  are  to  be  defined  in  each  interval  [Ct  ,Q+]  of  U  with  the 
following  Gaussian  membership  functions 


Mff(xi)  =  exp 


*i-xi 


rV 


where  /  =  1,2,..., M,  /  =  1,2, ...,/i,  x,  e[Q,C;+],  and  xj  and  a\  are  free  parameters  which 
will  be  updated  in  the  LMS  adaptation  procedure  of  Step  4. 


Step  2:  A  set  of  M  fuzzy  IF-THEN  rules  is  to  be  constructed  in  the  following  form: 

R IF x,  is F’  and  ...  xn  is  F*,  THEN d is  Gl, 

Rm :  IFx,  is  F“  and  ...  x„  is  F„M ,  THEN  d  is  GM. 

where  x  =  (x^.-.x^^  eU,  d  (=R,  F?' s  are  defined  in  Step  1,  and  G1' s  are  fuzzy  sets 
defined  in  R.  The  (parameters  of)  membership  functions  nF,  and  /uG,  in  these  rules  will 

change  during  the  LMS  adaptation  procedure  of  step  4.  Therefore,  the  rules  constructed 
in  this  step  are  initial  rules  of  the  fuzzy  adaptive  filter. 


Step  3:  The  filter  fk :  U ->  R  is  constructed  based  on  the  M  rules  of  the  Step  2  as 


follows: 


fM)  = 


M  ( 


where  u  ,'s  are  the  Gaussian  membership  functions  of  Step  1,  and  01  sR  is  any  point  at 
which  u,  achieves  its  maximum  value. 

*  G 

13This  algorithm  is  suggested  in  [Wangl993]  and  [Wangl9941. 
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Because  we  chose  the  membership  functions  to  be  Gaussian  functions  which  are  nonzero 
for  any  x,  e[C“,C(+],  the  denominator  of  the  last  equation  is  nonzero  for  any  xeU. 
Therefore,  the  filter  fk  is  well  defined,  and  because  the  O'  as  well  as  xj  and  a\  are  free 
parameters,  this  filter  is  nonlinear  in  the  parameters. 


Step  4:  The  following  LMS  algorithm  [Widrowl985]  is  used  to  update  the  filter 
parameters  O',  xj  and  a/.  With  the  initial  O'(o),  xj( 0)  and  or/(0)  values  determined  in 

Step  2,  the  adaptive  procedure  is  as  following: 


0'{k)  =  6'{k  - 1)  +  o[d{k)  - 

l(i\  ~l(t  .  fj(i\  f  1  If  i  Xi(k  _l) 

(A:)  =  X,  {k  - 1)  +  C{d{k )  -  fk\  Yi  °  ^  ^  (^/(Jfc-l))2 


b(k- 1) 

7,'{k)  =  <r‘{k - 1)  +  c{d{k)-fk]  -  ^ ^  —  a'{k - 1) 


(x#(^)-x/(Ar-l)) 

(cr/(*~l))3 


where  a1  (k-l)  =  nexp[-— (— 

<=i  2 


(k)-x!(k-l)  2 


],b(k-l)=Za'(k-l),fk 
/= 1 


M 

M _ 

b(k-l) 


and  a  is  a  small  positive  step-size.  These  equations  are  obtained  by  taking  the  gradient  of 
L  ignoring  the  expectation  is  (see  chapter  2.2.5. 1). 


2. 2. 5. 2.  Influential  parameters  -  meanings  &  interpretations: 

The  LMS  algorithm  is  a  gradient  algorithm,  i.e.  a  good  choice  of  initial  parameters  O' ,  x' 
and  a /  is  very  important  to  its  convergence  concerning  accuracy  and  time.  Since  the  error 
measure  of  this  "back-propagation"  algorithm  is  an  extremely  complicated  function  of  all 
the  parameters  0l,  xj  and  o\,  it  can  have  numerous  local  minima.  Depending  on  the 
initial  parameter  estimates,  this  algorithm  always  leads  to  the  nearest  minimum,  i.e.  it  can 
become  stuck  in  a  local  minimum  of  the  error  measure. 


Recall  that  this  filter  is  constructed  based  on  linguistic  rules  from  our  previous  experiences 
and  some  arbitrary  rules.  Both  sets  of  rules  are  updated  during  the  LMS  adaptation 
procedure  of  Step  4  by  changing  the  parameters  in  the  direction  of  minimizing  L . 
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In  other  words,  the  adaptation  procedure  can  be  directed  to  the  local  minimum  we  want 
(i.e.  accuracy  factor)  and  can  converge  quickly  (i.e.  time  factor). 

if  these  rules  provide  good  instructions  for  how  the  filter  should  perform,  that  is,  good 
description  of  the  input-output  pairs  [*(#);  ^(fc)]. 

The  updating  parameters  6>'[Mxl],  x'[MxN]  andcr/[MxN]  represent  output  means,  input 
means  and  the  input  width  of  the  Gaussian  distributed  data,  respectively.  The  scalar  output 
d  is  basically  the  label14  of  the  test  data[lxN]  in  numerical  form,  and  o\  describes  how  far 
the  data  from  the  output  mean  can  be  and  still  be  assigned  to  it  in  an  appropriate  fuzzy 
form.  M  represents  the  number  of  the  rules  and  N  the  number  of  the  features,  i.e.  the 
dimension  of  the  data.  The  parameter  a  is  the  "learning  factor"  or  the  step-size  of  training. 
It  represents  how  fast  and  how  smooth  the  training  process  proceeds. 


Fig.5:  The  LMS  fuzzy  adaptive  filter  used  in  this  project 


14"deceptive"  or  "non-decptive". 
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3.1.  Part  I  -  FCM 


§3.  APPROACH 


3.1.2.  Initial  stage  (conditions  and  methods): 

A  primary  component  of  every  pattern  recognition  problem  is  feature  extraction.  And  this 
is  actually  one  of  the  most  important  and  influential  tasks  for  any  successful  approach. 

In  previous  researches  [Layeghi 1993,1]  [Jacobsl993]  [Dastmalchil993],  students  have 
already  investigated  a  set  of  669  features  for  each  polygraph  test  session.  They  used  these 
features  to  train,  optimize  and  eventually  classify  the  data  by  a  fuzzy  K-Nearest  Neighbor 
algorithm  (KNN). 

In  this  project,  I  have  used  these  same  features  in  their  original  form.  I  have  also  selected 
their  best  features  and  feature  combinations  for  initial  tests  of  my  algorithm  and  for 
comparison  between  fuzzy-CM,  fuzzy  LMS  adaptive  filter  and  the  fuzzy  KNN  approach. 
At  this  point,  the  question  of  consistency  and  transferability  of  the  features  -  independent 
of  the  algorithm  -  became  more  significant.  It  turned  out  to  be  one  part  of  this  research15. 


Fig.6:  An  example  for  a  set  of  polygraph  data  as  a  matrix 
and  its  features  used  in  this  study 

As  mentioned  earlier,  each  feature  (total  number=960)  is  extracted  for  all  polygraph  test 
questions ,  that  is  for  relevant,  irrelevant  and  control  questions.  It  was,  however,  decided 

15See  also  chapter  4. 1.2.3. 
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not  to  use  irrelevant  questions  in  this  study,  because  in  a  Controlled  Question  Polygraph 
Test  comparison  between  the  responses  to  relevant  and  control  questions  is  the  actual  and 
most  important  factor. 


Fig.7:  The  original  feature  combinations 


The  Total  number  of  the  features  for  every  test  session  at  this  stage  is  669.  Each  set 
contains  the  same  non-deceptive  files  but  different  deceptive  ones.  For  more  specific 
details  about  how  the  feature  extraction  was  processed,  and  about  combination  methods 
which  narrowed  the  total  number  from  960  to  669,  see  the  references  mentioned  above. 


3.1.3.  Clustering  stage 


3.  L  3.  L  One-dimensional  search  and_  selection  of_  the  [ 'best "  single  features 

After  implementation  and  initial  tests  of  the  FCM-code,  I  began  with  the  one-dimensional 
clustering  (using  one  feature  for  all  sessions).  I  used  three  sets  (polydat_l,  polydat_2, 
polydat_3)  of  such  structured  data  as  shown  in  Fig.42  containing  100  data  elements,  i.e. 
50  truthful  and  50  deceptive  files.  With  these  data,  we  ran  669  one-dimensional  clustering 
searches  containing  100  different  one-dimensional  data  points  at  each  time.  As  a  result,  we 
attained  669  times  2  clusters  for  each  polydatj. 
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After  running  these  tests  and  evaluating  them,  I  decided  to  select  four  sets  of  "best"  one¬ 
dimensional  features  out  of  each  polydat_i  in  preparation  for  the  multi-dimensional 
clustering  search.  This  decision  was  necessary  to  narrow  the  number  of  features,  since  it  is 
impractical  to  find  the  best  combination  (concerning  the  quantity  and  the  quality)16  out  of 
this  immense  number  of  features  by  an  exhaustive  way  of  searching. 


For  example,  chosing  only  4  or  less  feature-tuples  from  a  set  of  669  by  trying  all  the 
possible  different  combinations  needs  the  following  number  of  computations: 


4  ( 669^ 

Lll  1  J 


.  T  6691  -10»0. 

i  =  1i!(669-i)! 


The  other  challenge  while  finding  good  feature  combinations  is  the  problem  of  single 
features  which  yield  poor  results  by  one-dimensional  clustering,  but  when  used  in 
combination  with  other  features  yield  very  good17  results. 

To  narrow  the  amount  of  possible  features,  I  decided  to  select  the  following  four  sets  of 
single  features  with  different  performances. 


percentage  of  right  detections  in 


deceptive  files 

non-deceptive  files 

group  1 

>  60% 

& 

>  60% 

group  2 

>  80% 

& 

>  50% 

group  3 

>  50% 

& 

£  80% 

group  4a 

>  98% 

& 

no  constraints 

group  4b 

no  constraints 

& 

>  98% 

Fig.8:  Selected  features  by  using  one-dimensional  FCM 


The  threshold  of  60%  was  chosen,  because  any  other  value  below  or  above  that  limit 
would  again  give  us  either  too  many  or  not  enough  features.  Furthermore,  any  other  value 

16That  means:  How  many  features  and  which  ones  should  be  taken  in  a  combination. 

17"Good"  or  "poor"  in  sense  of  the  definition  in  chapter  1.1.2. 
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closer  to  the  limit  50%  for  both  deceptive  and  non-deceptive  files  would  be  only  a  random 
clustering  process.  Yet,  this  decision  was  not  enough.  We  would  have  lost  some  good 
features  which  provide  correct  detections  -  better  than  80%  -  for  at  least  one  of  the  files. 
The  fourth  group  was  chosen  to  enable  us  to  consider  some  extreme  cases. 

As  an  additional  set  of  one-dimensional  features,  I  chose  those  with  good  results  in  multi¬ 
dimensional  tests18  for  one  of  the  polydat_i's,  and  used  them  also  for  the  other  two 
polydat_i's,  even  though  they  didn't  belong  to  one  of  the  four  feature  sets  mentioned 
above.  This  set  was  important  to  fulfill  the  constraint  of  consistency  and  transferability  for 
any  chosen  polygraph  data19. 


18See  chapter  3. 1.3.2. 

19See  the  comparison  in  chapter  4. 1.2.3. 
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1.0000 

2.0000 

3.0000 

4.0000 

5.0000 

6.0000 

7.0000 

8.0000 

9.0000 

10.0000 

11.0000 

12.0000 

13.0000 

14.0000 

15.0000 

16.0000 

17.0000 

18.0000 

19.0000 

20.0000 

21.0000 

22.0000 

23.0000 

24.0000 

25.0000 

26.0000 

27.0000 

28.0000 

29.0000 

30.0000 


w-dcp 

# 

12.0000 

37.0000 

16.0000 

12.0000 

15.0000 

38.0000 

48.0000 

22.0000 

22.0000 

22.0000 

0 

20.0000 

46.0000 

22.0000 

12.0000 

37.0000 

16.0000 

12.0000 

15.0000 

38.0000 

48.0000 

12.0000 

10.0000 

21.0000 

18.0000 

24.0000 

12.0000 

46.0000 

18.0000 

12.0000 


dcp-ok 

% 

76.0000 

26.0000 

68.0000 

76.0000 

70.0000 

24.0000 

4.0000 

56.0000 

56.0000 

56.0000 

100.000 

60.0000 

8.0000 

56.0000 

76.0000 

26.0000 

68.0000 

76.0000 

70.0000 

24.0000 

4.0000 

76.0000 

80.0000 

58.0000 

64.0000 

52.0000 

76.0000 

8.0000 

64.0000 

76.0000 


w-non 

# 

9.0000 

44.0000 

10.0000 

18.0000 

16.0000 

27.0000 

0 

9.0000 

8.0000 

11.0000 

33.0000 

15.0000 

26.0000 

11.0000 

9.0000 

44.0000 

10.0000 

17.0000 

16.0000 

27.0000 

0 

14.0000 

45.0000 

15.0000 

24.0000 

19.0000 

23.0000 

2.0000 

9.0000 

10.0000 


non-ok 

% 

82.0000 

12.0000 

80.0000 

64.0000 

68.0000 

46.0000 

100.000 

82.0000 

84.0000 

78.0000 

34.0000 

70.0000 

48.0000 

78.0000 

82.0000 

12.0000 

80.0000 

66.0000 

68.0000 

46.0000 

100.000 

72.0000 

10.0000 

70.0000 

52.0000 

62.0000 

54.0000 

96.0000 

82.0000 

80.0000 


iter_# 

13.0000 

15.0000 

14.0000 

15.0000 

16.0000 

15.0000 

40.0000 

8.0000 

13.0000 

38.0000 

26.0000 

6.0000 

10.0000 

16.0000 

27.0000 

17.0000 

25.0000 

37.0000 

40.0000 

34.0000 

31.0000 

25.0000 

20.0000 

23.0000 

29.0000 

18.0000 

22.0000 

35.0000 

28.0000 

14.0000 


£=669 


447.0000 

448.0000 

449.0000 

450.0000 

451.0000 

452.0000 

453.0000 


17.0000 

7.0000 

16.0000 

12.0000 

13.0000 

5.0000 

18.0000 


66.0000 

86.0000 

68.0000 

76.0000 

74.0000 

90.0000 

64.0000 


36.0000 

40.0000 

11.0000 

9.0000 

18.0000 

20.0000 

18.0000 


28.0000 

20.0000 

78.0000 

82.0000 

64.0000 

60.0000 

64.0000 


17.0000 

25.0000 

15.0000 

15.0000 

20.0000 

13.0000 

12.0000 


662.0000 

663.0000 

664.0000 

665.0000 

666.0000 

667.0000 

668.0000 

669.0000 


27.0000 

16.0000 

21.0000 

31.0000 

34.0000 

25.0000 

15.0000 

15.0000 


46.0000 

68.0000 

58.0000 

38.0000 

32.0000 

50.0000 

70.0000 

70.0000 


34.0000 

30.0000 

37.0000 

23.0000 

17.0000 

28.0000 

37.0000 

39.0000 


32.0000 

40.0000 

26.0000 

54.0000 

66.0000 

44.0000 

26.0000 

22.0000 


9.0000 

9.0000 

17.0000 

14.0000 

45.0000 

20.0000 

12.0000 

11.0000 


Feature  number:  ft_# 

#  of  wrong  results  in  decept.  data:  w-dcp 
%  right  detection  in  decept.  data:  dcp-ok 

#  of  wrong  results  in  truthful  data:  w-non 
%  right  detection  in  truthful  data:  non-ok 

Iterations  #  for  each  feature:  iter  # 


Fig.9:  An  example  for  one-dimensional  clustering 
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ft  #  w-dcp 

dcp-ok 

w-non 

non-ok 

iter_# 

£=45 

# 

% 

# 

% 

1.0000  12.0000 

76.0000 

9.0000 

82.0000 

13.0000 

3.0000  16.0000 

68.0000 

10.0000 

80.0000 

14.0000 

4.0000  12.0000 

76.0000 

18.0000 

64.0000 

15.0000 

5.0000  15.0000 

70.0000 

16.0000 

68.0000 

16.0000 

12.0000  20.0000 

60.0000 

15.0000 

70.0000 

6.0000 

15.0000  12.0000 

76.0000 

9.0000 

82.0000 

27.0000 

17.0000  16.0000 

68.0000 

10.0000 

80.0000 

25.0000 

18.0000  12.0000 

76.0000 

17.0000 

66.0000 

37.0000 

19.0000  15.0000 

70.0000 

16.0000 

68.0000 

40.0000 

22.0000  12.0000 

76.0000 

14.0000 

72.0000 

25.0000 

29.0000  18.0000 

64.0000 

9.0000 

82.0000 

28.0000 

30.0000  12.0000 

76.0000 

10.0000 

80.0000 

14.0000 

31.0000  14.0000 

72.0000 

16.0000 

68.0000 

21.0000 

33.0000  18.0000 

64.0000 

16.0000 

68.0000 

14.0000 

36.0000  15.0000 

70.0000 

8.0000 

84.0000 

14.0000 

37.0000  8.0000 

84.0000 

13.0000 

74.0000 

15.0000 

38.0000  12.0000 

76.0000 

14.0000 

72.0000 

18.0000 

39.0000  14.0000 

72.0000 

13.0000 

74.0000 

17.0000 

40.0000  16.0000 

68.0000 

15.0000 

70.0000 

13.0000 

50.0000  17.0000 

66.0000 

17.0000 

66.0000 

18.0000 

52.0000  15.0000 

70.0000 

20.0000 

60.0000 

23.0000 

68.0000  13.0000 

74.0000 

18.0000 

64.0000 

17.0000 

70.0000  20.0000 

60.0000 

20.0000 

60.0000 

23.0000 

82.0000  16.0000 

68.0000 

20.0000 

60.0000 

12.0000 

141.0000  17.0000 

66.0000 

17.0000 

66.0000 

15.0000 

155.0000  17.0000 

66.0000 

17.0000 

66.0000 

25.0000 

176.0000  16.0000 

68.0000 

18.0000 

64.0000 

13.0000 

177.0000  16.0000 

68.0000 

16.0000 

68.0000 

13.0000 

197.0000  13.0000 

74.0000 

17.0000 

66.0000 

15.0000 

200.0000  17.0000 

66.0000 

13.0000 

74.0000 

12.0000 

211.0000  13.0000 

74.0000 

16.0000 

68.0000 

42.0000 

214.0000  17.0000 

66.0000 

12.0000 

76.0000 

27.0000 

216.0000  15.0000 

70.0000 

14.0000 

72.0000 

32.0000 

235.0000  15.0000 

70.0000 

19.0000 

62.0000 

14.0000 

395.0000  18.0000 

64.0000 

17.0000 

66.0000 

10.0000 

449.0000  16.0000 

68.0000 

11.0000 

78.0000 

15.0000 

450.0000  12.0000 

76.0000 

9.0000 

82.0000 

15.0000 

451.0000  13.0000 

74.0000 

18.0000 

64.0000 

20.0000 

452.0000  5.0000 

90.0000 

20.0000 

60.0000 

13.0000 

453.0000  18.0000 

64.0000 

18.0000 

64.0000 

12.0000 

Feature  number:  ft_# 

458.0000  16.0000 

68.0000 

14.0000 

72.0000 

8.0000 

#  of  wrong  results  in  decept.  data:  w-dcp 

459.0000  20.0000 

60.0000 

10.0000 

80.0000 

10.0000 

%  right  detection  in  decept.  data:  dcp-ok 

460.0000  14.0000 

72.0000 

18.0000 

64.0000 

9.0000 

#  of  wrong  results  in  truthful  data:  w-non 

462.0000  14.0000 

72.0000 

17.0000 

66.0000 

7.0000 

%  right  detection  in  truthful  data:  non-ok 

600.0000  18.0000 

64.0000 

20.0000 

60.0000 

37.0000 

Iterations_#  for  each  feature:  iter_# 

Fig.10:  An  exmple  for  the  first  group  of  selected  features 
( representing  group  #1  at  page ) 
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3. 1.3.2.  Multi-dimensional  search  (or  the  best  feature  combination : 


3.  L  3. 2.  L  Overview 


Having  obtained  these  four  sets  of  features,  a  multi-dimensional  searching  process  through 
all  of  them  was  initiated  to  find  the  best  feature  combinations  (concerning  the  quantity  and 
the  quality20). 

Even  though  the  number  of  the  features21  has  already  been  narrowed,  it  is  still  impractical 
to  do  an  exhaustive  search,  since  the  total  number  of  the  features  contained  in  these  four 
sets  is  about  100  for  each  polydat_i.  In  other  words,  the  following  number  of 
computations  is  still  needed  for  calculation  of  all  4  or  less  possible  feature-tuples: 


4  A0(A  4  100! 

i  =  ll  '  J"i?i«(100-i)l 


«4.0-106 . 


At  this  stage,  I  decided  to  investigate  3  different  search  methods  to  bypass  the  exhaustive 
way.  They  are 

1 .  random  search  without  duplication  of  any  feature  within  a  tuple, 

2.  pseudo-exhaustive  search  with  the  option  of  duplication  and  finally 

3.  genetic  search  with  "uncontrollable”  possibility  of  duplications. 

In  previous  research  projects  [Layeghi  1993,1]  [Dastmalchil993]  [Jacobsl993],  it  was 
decided  to  narrow  the  feature  numbers  from  669  to  30  "best"  ones  and  then  an  exhaustive 
search  was  run  for  up  to  four-  or  five-tuple  combinations.  In  other  words,  their  strategy 
was  completely  different  than  the  aforementioned  three  strategies. 

As  mentioned  before  a  "poor"  or  an  average  single  feature  by  one-dimensional  clustering 
might  give  us  in  combination  with  other  features  very  good  or  even  better  results  by  a 
multi-dimensional  clustering  than  any  of  them  individually. 

This  fact  was  totally  neglected  by  the  feature  selection  methods  used  in  the  previous 
researches22  [Laueghil993,l]  [Dastmalchil993], 

20That  means:  How  many  features  and  which  ones  should  be  taken  in  a  combination. 

21See  chapter  3.1.3. 1. 

22See  chapter  4.3.  comparison  for  more  details  about  differences  between  this  and  previous  works. 
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Applying  these  three  new  strategies,  I  was  able  to  consider  more  possible  features  for  a 
multi-dimensional  clustering  than  in  previous  works,  without  using  the  impractical 
exhaustive  method. 


Fig.ll:  General  search  to  find  the  best  feature  combination 


3. L3. 2. 2.  Random  search  method 


Applying  this  method,  an  average  of  14  to  20  different  features  out  of  the  aforementioned 
four  sets  were  taken,  and  then  the  FCM  algorithm  including  the  evaluation  program  for 
randomly  chosen  4-tuples  were  run.  After  about  1000  combinations  were  constructed,  I 
then  picked  out  the  best  features  and  their  combinations,  and  replaced  the  poor  ones  with 
new  features.  This  same  procedure  was  repeated  until  good23  combinations  were  found. 

23"Good"  in  sense  of  the  definition  in  chapter  1.1.2. 


4-29 


Every  time  the  results  were  out  of  balance  -  i.e.  highly  better  detection  either  for  deceptive 
or  non-deceptive  files  by  the  cost  of  the  other  one  -  I  appropriately  took  additional 
features  from  those  four  sets  to  eliminate  the  difference  by  improving  the  results  of  the 
worse  file  -  and  as  much  as  possible  -  by  maintaining  the  results  of  the  better  file. 

After  running  this  kind  of  tests  several  times,  we  were  able  to  estimate  which  features  are 
the  good  ones  to  combine  together. 


3. L 3. 2. 3.  Pseudo-exhaustive  search  method 

Having  some  idea24  which  features  are  good  in  a  combination  with  others25, 1  built  every 
possible  four-  to  six-tuples  out  of  those  features  and  evaluated  them.  This  method  was 
very  important  to  make  sure  that  we  did  not  lose  any  good  combinations  which  might 
have  been  neglected  by  the  random  search. 

I  called  this  method  "pseudo' "-exhaustive,  because  each  time  it  considers  only  a  small  part 
of  the  available  features;  but  "exhaustive",  because  it  takes  all  the  possible  combinations 
within  this  part.  Except  for  this  major  difference,  all  the  other  steps  of  this  method  are 
exactly  the  same  as  the  random  search. 


3.1. 3.2.4.  Genetic  search  method: 


This  algorithm  is  basically  a  compromise  between  the  pseudo-exhaustive  and  the  random 
search  method,  plus  a  weighting  system  which  supports  those  features  with  good  results. 

Initial  populations  of  200  to  300  chromosomes26  are  randomly  created.  Each  chromosome 
is  a  combination  of  N  features,  where  N  stays  constant  for  each  population  during  the 
outgrowth.  Each  single  feature  is  selected  from  a  gene  pool  for  the  particular  population 
that  the  individual  belongs  to.  Each  gene  pool  consists  of  twenty  to  forty  features  that  we 
have  chosen27. 

24By  using  the  results  of  the  random  search  method  and  also  the  5th  group  mentioned  at  page  3.1.3. 1. 
25Remember  the  fact  that  some  "poor”  single  features  might  give  us  in  combination  with  others  very 
good  results 

^Individuals  or  feature-tuples. 

27Directed  by  our  experience  from  using  the  random  and  the  pseudo-exhaustive  methods. 
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In  this  project  three  processes  operate  on  the  evolution28  of  each  population: 

•  reproduction 

•  crossover 

•  mutation. 

These  three  processes  determine  how  each  new  generation  will  be  created  based  on  the 
old  one.  Before  genetic  reproduction,  the  fiizzy-c-means  algorithm  evaluates  the 
percentage  of  correct  deceptive  and  non-deceptive  detections  for  each  chromosome.  The 
average  of  them  is  the  fitness  value  of  that  chromosome.  During  the  genetic  reproduction, 
the  chromosomes  of  the  new  generation  are  copied  from  the  chromosomes  of  the  old 
generation  in  a  probabilistic  sense.  The  probability  that  a  particular  chromosome  will  be 
copied  is  the  ratio  of  that  chromosome's  fitness  value  against  the  total  fitness  values  of  the 
entire  population  of  the  old  generation. 

After  selection,  genetic  crossover  randomly  chooses  pairs  of  chromosomes  as  parents, 
splices  them,  and  recombines  them  -  by  randomly  mixing  some  of  the  parents  genes  -  into 
pairs  of  offsprings.  Finally,  genetic  mutation  randomly  substitutes  a  new  gene  within  a 
randomly  chosen  chromosome.  The  extent  to  which  crossover  and  mutation  occur  can  be 
verified  by  appropriate  initialization. 


Fig.12:  An  example  for  the  genetic  outgrowth  with 
4  genes  (=features)  in  each  chromosome  (individual) 


28See  chapter  4. 1.2.2  for  particular  results  of  this  method. 
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3, 7.3.3.  General  process  -  Optimization  bv  changing  parameters: 


Simultaneously  to  the  search  for  the  best  features  and  their  combinations,  we  were 
optimizing  the  system  by  changing  and  adjusting  the  parameters.  Recall,  the  whole  idea  of 
this  pattern  recognition  was  to  cluster  the  unlabeled  data  into  two  clusters  which  represent 
the  deceptive  and  the  truthful  group29. 

Knowing  the  information  of  which  files  were  deceptive  or  truthful30,  we  were  able  to 
change  the  parameters  in  the  way  that  the  output  could  continuously  come  closer  to  the 
real  cluster  structure.  This  process  is  depicted  in  the  following  figure.  The  "fuzzy  c-means 
algorithm"  block  not  only  represents  the  pure  FCM  algorithm  shown  in  Fig.3,  but  also  the 
general  search  for  good  features  shown  in  Fig.  11  which  ran  simultaneously  with  the 
optimization  process. 


Fig.  13:  Optimization  of  the  clustering  environment 
-  General  process  - 


As  an  example,  I  will  briefly  discuss  how  the  parameter  m  was  chosen  and  eventually 
modified:  The  weighting  exponent  m  plays  a  significant  role  in  this  system.  Since  the 
control  parameter  m  itself  does  not  belong  to  the  optimizing  values  within  the  iterative 
process  of  FCM  algorithm,  one  must  choose  m  before  implementing  the  algorithm,  and 


29See  chapter  3.1.2. 

30We  know  this  information  beforehand  for  sure,  because  the  subjects  have  confessed  their  case  or  the 
actual  offender  was  found. 
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optimize  it  manually.  There  are  several  research  papers  written  as  an  attempt  to  find  the 
optimal  m  for  different  clustering  problems. 

The  effect  of  m  was  discussed  in  [Bezdekl981].  Although  Bezdek  proposed  heuristic 
guidelines  for  m,  no  theoretical  basis  for  an  optimal  choice  for  tn  has  been  reported.  The 
only  known  paper  in  this  matter  [Choel992]  proposed  a  method  for  determining  m  based 
on  the  concept  of  fuzzy  decision  theory  initiated  by  [Zadehl970], 

But  since  the  definition  of  "good"  clusters  in  [Choel992]  did  not  exactly  match  to  our 
clustering  environment,  I  chose  the  "trail  and  error"  strategy  to  find  the  optimal  m  by 
systematically  increasing  it.  Fortunately,  there  is  a  logical  limit31  for  this  increasing  process 
in  our  case,  even  though  m  can  mathematically  be  any  value  from  [  2,  oo  ). 


". "  represents  the  mmebership  values  for  m= 2 
"+  ”  represents  the  membership,  values^  for  m  =5 


Fig.14:  An  example  for  the  influence  of ’m’ 


31See  chapter  2.2.3.2.  for  the  meaning  of  m. 
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For  more  details  on  this  matter  see  the  chapter  4.1.1.  In  Fig.  14,  you  see  an  example  for 
how  the  weighting  exponent  m  influences  the  membership  values  for  one  of  the  features 
from  polydat_3  in  one-dimensional  mode. 


3.  L  3. 4.  Evaluation  strategy 


Due  to  the  small  number  of  non-deceptive  cases  available,  each  session  for  a  subject  was 
used  as  a  separate  and  individual  case.  But  in  average,  each  group  of  three  sessions  belong 
to  one  person  concerning  the  same  crime,  meaning  the  results  of  these  sessions  are  not 
independent  of  each  other.  Using  this  additional  information,  the  clustering  system  can 
come  closer  to  the  actual  structure  of  the  data,  i.e.  we  can  get  a  better  performance. 


session  #1 


non-deceptive 


polygraph  examination  for  a  deceptive  subject 


fc]  :  Control  question 
[r|  :  Relevant  question 


session  #2 


deceptive 


session  #3 


© 

(D 

© 

© 

© 


FCM 


deceptive 


deceptive 


Fig.  15:  An  example  for  the  final  evaluation  using  the 
dependency  of  the  sessions 
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After  clustering  and  evaluating32  each  session  separately,  some  cases  with  different 
responses  to  the  algorithm  were  found,  although  they  belonged  to  one  person.  In 
circumstances  like  this,  we  combined  the  individual  results  within  each  group  in  a  way  that 
the  majority  response  was  assigned  to  the  whole  group  (see  Fig.  15). 

In  those  cases  that  each  polygraph  examination  contains  2  or  4  test  sessions  where  there  is 
no  majority  response  to  build,  I  decided  to  take  only  those  membership  values  further  to 
the  threshold  0.5.  For  example,  by  the  feature  combination  [30,  30,  39,  235,  363,  450] 
used  to  cluster  polydat_l,  we  obtained  for  one  of  the  examination  with  four  sessions  the 
following  membership  values:  0.4164,  0.5519,  0.5377,  0.4780.  After  defuzzification  we 
got  0,  1,  1,  0  where  no  majority  class  can  be  build.  However,  the  second  and  the  third 
membership  values  are  closer  to  the  threshold  than  the  other  two  ones.  With  the 
aforementioned  strategy,  this  examination  is  labeled  with  0. 

Recall  that  each  polygraph  examination  has  a  set  of  control  and  relevant  questions  which 
is  repeated  an  average  of  three  times.  The  only  difference  between  each  session  is  the 
order  in  which  the  questions  are  asked. 


32The  general  evaluation  process  is  contracted  as  following: 

After  each  clustering  procedure  ( one-  or  multi -dimensional )  a  two-row  vector  of  membership  values  is 
given  which  represent  the  two  deceptive  and  non-deceptive  clusters.  The  evaluation  process  takes  the 
membership  values  of  one  these  clusters  and  counts  the  values  below  and  above  the  threshold  0.5.  Thus,  as 
a  result  we  get  the  absolute  number  of  wrong  and  right  detections. 
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3.2.  Part  II  -  LMS  fuzzv  adaptive  filter 


3.2.7.  Feature  selection  bv  visual  inspection: 

One  advantage  of  a  fuzzy  logic  system  is  its  use  of  common  sense  human  reasoning  as 
inference  rules.  The  fuzzy  LMS  algorithm  we  used  extends  this  advantage  by  further 
optimizing  such  inference  rules  to  "fit”  a  given  set  of  data.  To  fully  utilize  the  advantages 
of  this  fuzzy  LMS  algorithm,  we  had  to  face  two  issues:  coming  up  with  the  proper 
intuitive  rules  for  initialization  and  a  set  of  data  that  reflects  real-world  examples  for 
training. 

As  mentioned  before,  for  practical  reasons,  the  polygraph  recognizer  can  use  only  a  subset 
of  the  given  669  features,  and  we  would  have  to  choose  the  effective  ones.  Furthermore, 
the  fuzzy  logic  system  needed  reasoning  rules,  operating  on  those  features  we  selected,  to 
analyze  the  data.  We  believed  that  we  could  visually  inspect  graphical  plots  of  the  feature 
data  to  learn  about  the  feature  information.  Since  fuzzy  logic  corresponds  closely  with 
human  reasoning,  we  would  then,  based  on  the  knowledge  obtained  from  our  visual 
inspection,  select  features  that  help  differentiate  deceptive  and  non-deceptive  subjects  and 
codify  the  patterns  we  would  find  into  reasoning  rules. 

For  the  visual  inspection,  a  scatter  plot  was  made  of  the  data  in  polydat_3  of  each  single 
feature.  We  looked  at  each  plot  individually.  In  any  given  plot,  if  the  deceptive  and  non- 
deceptive  subjects  showed  distinctive  clusters,  then  the  feature  was  considered  good.  If 
the  elements  of  these  two  classes  seemed  to  be  randomly  located,  then  the  feature  was 
considered  bad.  After  viewing  all  669  plots,  we  subjectively  determined  the  following 
features33  to  be  very  good:  9, 11,  29,  164,  399, 449, 450,  451,  452,  and  454;  with  451  and 
452  to  be  the  best. 

Initially  the  fuzzy  adaptive  filter  was  to  be  designed  based  on  two  features,  with  more 
features  to  be  added  in  the  future  as  the  project  progresses.  We  limited  the  feature  couple 
to  be  composed  of  good  features  from  the  above  list.  Visual  inspection  was  made  of  the 
scatter  plots  of  the  data  in  polydat_3  of  various  such  feature  combinations  to  determine 
the  effective  ones.  While  selecting  feature  couples,  we  again  searched  for  combinations 
that  show  distinctive  clusters  for  deceptive  and  non-deceptive  subjects.  The  features 


33See  Fig.41  for  the  meaning  of  these  numbers. 
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within  a  combination  should  also  be  uncorrelated  with  each  other.  A  plot  of  the  feature 
449  and  450  combination  shows  that  they  are  a  bad  couple  because  they  seem  to  be 
linearly  correlated34,  as  the  data  points  fall  closely  along  a  straight  line. 


non-deceptive  files 
'o':  deceptive  files 
_ polydat_3 


Fig.  16:  Scatter  plots  of  two  linearly  correlated  features 

Visual  inspection  of  feature  couples  consumed  much  more  time  than  visual  inspection  of 
individual  features,  as  the  clusters  took  on  more  complicated  shapes.  Furthermore,  in  the 
fuzzy  LMS  algorithm  each  inference  rule  exerts  influence  centered  in  an  elliptical  contour 
where  the  major  and  minor  axes  are  parallel  with  the  axes  of  the  feature  plot.  Clusters  with 
a  complicated  shape  must  be  built  from  those  elliptical  regions  (see  next  figure).  Therefore 
we  had  the  additional  task  of  finding  clusters  in  the  feature  plots  that  could  be  easily 
approximated  with  few  ellipses,  to  reduce  system  complexity. 

Due  to  the  lack  of  time,  we  did  not  examine  the  plots  of  all  forty-five  possible 
combinations  of  the  ten  very  good  features  listed  above.  We  only  examined  a  random  few. 
Based  on  the  ones  we  did  examine,  we  settled  on  the  combination  of  features  451  and  452 
because: 


34Correlation  between  two  features  means  that  information  in  one  is  similar  to  the  information  in  the 
other  one,  and  using  them  together  only  introduces  redundancy  and  hardly  improves  the  system. 
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feature  #  452 


•  they  were  the  best  -  visually  recognizable  -  features  individually, 

•  they  seemed  uncorrelated  with  each  other  and 

•  we  roughly  found  four  elliptical  clusters  from  the  plot. 


feature  #  451 

non-deceptive  files 
'o':  deceptive  files 
_ Polydat-S 


Fig.17:  The  four  elliptical  clusters  used  for  setting  the  linguistic  rules 


3.2.2.  Setting  linguistic  rules: 


We  initialized  the  fuzzy  system  such  that  it  would  exploit  the  knowledge  we  had  just 
obtained  about  the  clusters  for  features  451  and  452.  There  were  two  inputs,  one  for  each 
feature,  and  four  rules,  one  for  each  cluster.  We  had  to  represent  those  visual  clusters  we 
found  with  inference  rules.  The  linguistic  rules  are  shown  in  the  following  figure. 


1.  IF  fl  is  about  -1  (±0.5)  and  f2  is  about  -0.5  (±0.8), 

THEN  decision  is  non-deceptive  =>  output  is  +1. 

2.  IF  fl  is  about  0  (±0.5)  and  f2  is  about  -0.25  (±0.25), 

THEN  decision  is  non-deceptive  =>  output  is  +1 . 

3.  IF  fl  is  about  0  (±0.1)  and  f2  is  about  0  (±0.2), 

THEN  decision  is  deceptive  =>  output  is  -1. 

4.  IF  fl  is  about  1  (±0.6)  and  f2  is  about  0.3  (±0.5), 

THEN  decision  is  deceptive  =>  output  is  -1. 

fl:  measurement  of  feature  #  451 
f2:  measurement  of  feature  #  452 


Fig.18:  Initial  linguistic  rules  for  the  fuzzy  adaptive  filter 
based  on  the  clusters  in  Fig.17 

The  linguistic  rules  above  were  then  translated  to  fuzzy  membership  functions  as  outlined 
in  [Wangl994],  The  xi's  were  the  centers  of  the  clusters;  the  sigmas  were  the  widths  of  the 
clusters  (±xxx  in  the  above  rules);  and  the  thetas  were  either  +1  or  -1  for  non-deception 
and  deception,  respectively. 

The  output  of  the  fuzzy  reasoning  based  on  the  above  four  rules  would  not  be  exactly  +1 
or  -1.  It  would  be  within  the  range  limited35  by  +1  and  -1.  For  our  project,  we  decided 
that  a  positive  output  denotes  non-deception  and  a  negative  output  denotes  deception.  In 
other  words,  the  decision  threshold  was  at  zero. 

35After  training  the  output  may  go  beyond  that  range. 
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For  future  investigations  one  may  experiment  with  a  different  threshold36. 

The  choice  of  plus  and  minus  one  for  non-deception  and  deception  is  based  on  the 
following  argument:  The  learning  technique  uses  the  squared  error,  which  is  the  square  of 
the  difference  between  the  desired  output  and  actual  output.  In  computing  that  squared 
error,  if  the  difference  between  the  desired  output  and  actual  output  is  greater  than  one, 
then  the  squaring  operation  expands  the  error  value  and  therefore  gives  more  significance 
to  such  mistakes.  On  the  other  hand,  if  the  difference  is  less  than  one,  than  the  squaring 
operation  compresses  the  error  value  and  therefore  gives  it  less  significance. 

Given  zero  as  the  threshold  between  deception  and  non-deception  and  assuming  the  actual 
output  would  never  go  beyond  plus  two  or  minus  two,  then  the  choice  of  plus  and  minus 
one  as  desired  outputs  would  mean  that  the  error  calculation  gives  more  significance  to 
misclassifications  and  less  to  correct  classifications;  Here  classification  refers  to  the  crisp, 
defuzzified  classification,  not  the  degree  of  belonging. 

For  example,  the  desired  output  for  non-deceptive  subjects  is  plus  one.  If  the  actual  output 
is  between  zero  and  two,  then  the  crisp  classification  is  non-deception,  which  is  correct. 
The  numerical  difference  between  the  actual  output  and  the  desired  Output  is  less  than  one 
in  this  case,  and  the  squaring  operation  would  lessen  the  significance  of  that  error.  On  the 
other  hand,  if  the  actual  output  is  less  than  zero,  then  the  crisp  classification  would  be 
deception,  which  is  wrong.  In  that  case,  the  numerical  difference  between  the  desired 
output  and  the  actual  output  is  greater  than  one  and  more  significance  would  be  given  to 
such  mistakes.  Similar  argument  can  be  apply  for  the  choice  of  minus  one  as  the  desired 
output  for  deceptive  subjects. 


3.2.3.  Trainings  testing  and  evaluation  strategy: 

The  fuzzy  LMS  algorithm  can  be  optimized  to  a  specific  set  of  data.  To  exploit  that  aspect 
of  the  algorithm,  we  also  selected  a  set  of  data  to  train  the  system.  Following  a  procedure 
similar  to  one  used  in  an  earlier  project  with  KNN  classifying  algorithm  [Layeghil993], 
we  had  35  deceptive  subjects  and  35  non-deceptive  subjects  -  from  each  polydat_i  -  for 


36One  may  also  view  the  output  as  a  fuzzy  value  and  map  it  to  a  confidence  value  in  addition  to  just  a 
deception/non-deception  decision.  That  would  differentiate  a  sure  judgment  from  an  unsure  one  and  may 
be  more  helpful  in  practice. 
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training.  However,  with  a  set  of  only  100  subjects  within  each  polydat_i,  that  left  a  rather 
small  amount  for  testing  (i.e.  15  deceptive  and  15  non-deceptive  subjects).  Therefore  we 
also  tested  the  algorithm  with  10  deceptive  subjects  and  10  non-deceptive  subjects  for 
training  and  the  rest  (40  deceptive  subjects  and  40  non-deceptive  subjects)  for  testing. 
That  might  be  a  bit  extreme  in  the  other  direction,  but  we  could  interpolate  the  results  and 
also  see  the  sensitivity  of  the  algorithm  to  the  amount  of  training  data. 

We  tested  both  cases  for  all  three  polydat_i's,  giving  a  total  of  six  tests.  Each  test  was 
repeated  twenty  times.  The  training  data  were  randomly  chosen  each  time,  and  the  rest  of 
the  available  data  in  each  set  were  used  for  testing.  We  recorded  for  each  test  the  average 
of  those  twenty  trials.  This  repeated  testing  was  done  to  ensure  that  the  results  were  not 
dependent  on  a  particular  choice  of  training  data. 
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3.2.4.  What  to  do  -with  the  memorizing,  problem? 


Most  learning  algorithms  suffer  the  dilemma  of  overlearning,  or  memorizing.  Usually  the 
problem  occurs  when  the  learning  algorithm  tries  too  hard  at  optimizing  itself  to  a  set  of 
training  data,  sometimes  to  the  point  of  memorizing  them,  such  that  it  does  not  generalize 
to  understand  new  data.  Overlearning  is  exacerbated  when  the  training  data  set  is  not 
completely  representative  of  the  testing  set. 

In  a  pattern  recognition  problem,  while  the  recognition  rate  for  the  training  data  may 
increase  steadily  until  it  reaches  a  certain  plateau,  the  recognition  rate  for  testing  data  may 
only  increase  for  a  while,  after  which  it  may  decrease  until  it  hits  a  plane.  We  observed 
such  phenomenon  in  our  system: 


The  training  data  consist  of  twenty  non-deceptive  subjects  and  twenty  deceptive  subjects  from  polydat_3. 

_ The  testing  data  consist  of  all  one  hundred  subjects  from  polydat_J . 


Fig.19:  An  example  for  memorizing  as  the  system  "learns" 

The  point  where  the  recognition  rate  starts  to  decrease  marks  the  beginning  of 
overlearning.  In  practical  applications,  most  adaptive  learning  algorithms  are  trained  only 
to  the  point  before  overlearning  occurs,  when  the  performance  on  the  testing  data  reaches 
its  peak. 


4-42 


In  our  testing  we  had  taken  that  approach  and,  for  each  trial,  the  percentage  of  correct 
recognition  was  taken  as  the  maximum  attained  for  the  testing  data  within  forty  epochs37. 

We  disregarded  the  recognition  rate  for  the  training  data  because  for  many  systems, 
including  our  own,  a  proper  set-up  could  easily  attain  a  recognition  rate  of  100%.  That  is, 
the  recognition  rate  of  the  training  data  bears  little  importance  in  practical  applications. 


37An  epoch  is  defined  as  one  complete  cycle  through  all  the  training  data. 
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§4.  RESULTS  AND  CONCLUSIONS 


4.1.  Fuzzv-c-means 


4JLL  Searchingjor  the  best  level  of  fuzziness  (parameter  'mill 

One  of  the  major  steps  during  the  one-dimensional  clustering  was  the  searching  process 
for  the  best  value  of  m38.  For  this  process,  it  was  necessary  to  run  the  FCM  algorithm  for 
different  m's  and  for  different  data  by  increasing  m  systematically.  This  was  done  for  all 
669  features  and  for  each  polydat_i,  by  every  new  m. 

Recall  that  it  was  decided  to  consider  four  groups  of  features  to  limit  the  feature  pool  for 
multi-dimensional  clustering.  Even  though  the  general  development  -  while  changing  m  - 
was  similar  for  each  polydat_i,  the  individual  reaction  of  these  4  groups  within  each 
polydat_i  was  a  little  different.  For  the  final  decision,  we  considered  all  these  variances, 
correct  detection  rates  and  also  the  distributions  of  the  membership  values  for  each  m. 

In  the  following,  I  will  mention  some  of  the  remarkable  observations  we  have  made  during 
this  process  (see  also  the  following  tables  and  figures  representing  the  results  of 
polydat_3): 

As  expected,  the  membership  values  Uik  did  approach  the  0.5-level39  by  increasing  m,  i.e. 
the  results  became  fuzzier.  Thus,  we  had  to  limit  the  increasing  process  to  avoid  the 
uncertainty  of  the  results  caused  by  too  much  "fuzziness"  (which  means  that  every  person 
belongs  to  both  clusters  with  almost  the  same  possibility).  However,  we  could  observe  a 
very  interesting  phenomenon.  Even  though  the  membership  values  came  closer  to  0.5,  and 
the  distances  for  different  persons  to  this  level  were  around  10~x  (with  x>  3),  they  were 
still  visually  recognizable  as  deceptive  and  truthful  clusters. 

See  the  following  two  figures  and  also  the  Fig.  14  for  examples.  Notice  that  the  first  50 
sessions  represent  the  non-deceptive  persons  and  the  other  50  the  deceptive  ones. 


38See  also  chapter  3. 1.3.3.  for  the  discussion  about  finding  the  best  TYl. 
39See  chapter  2.2.3.2.  for  more  details. 
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membership  values 


"  represents  the  membership  values  for  m=10 
”+n  represents  the  membership  values  for  m=5 


Fig.20:  Influence  of  increasing ’m’  for  polydat-3  session  #1 


Fig.21:  The  zoomed-in  view  of  the  above  figure  for  m=10 
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In  the  following  two  tables,  the  influence  of  changing  m  (for  polydat_3/group  #1,  as  an 
example)  is  depicted.  As  mentioned  earlier  this  group  represents  those  features  which  give 
us  better  than  60%  right  detection  for  both  deceptive  and  non-deceptive  files  by  one¬ 
dimensional  clustering. 

As  you  see  in  these  examples,  while  increasing  the  parameter  m,  new  "good"  features 
appear.  Some  old  ones  provide  even  better  detection  rates  and  some  get  worse  or  even 
disappear.  This  progress  is  not  unlimited.  As  you  see,  the  development  from  'm=4'  to  'm=5' 
is  smoother  than  between  'm=2'  and  ’m=4'  regardless  of  'm=3'  step.  By  continuing  this 
process  above  'm=5',  the  tendency  becomes  rather  negative. 

Those  features  marked  with  (*)  represent  a  better  detection  rate  than  75%  at  least  in  one 
of  the  two  clusters.  Notice  that  these  features  also  change  during  the  increasing  process  of 
nt.  By  continuing  this  process  above  'm=5',  also  this  tendency  becomes  rather  negative. 

After  considering  the  other  groups40  and  their  development  for  each  polydat_i,  'm=5' 
appeared  to  be  the  best  compromise.  Notice  that  there  is  also  an  outstanding  result  for 
feature  number  452  by  'm=5'  (see  Fig.23).  That  was  the  only  inidmdual  feature  ever  by  an 
one-dimensional  clustering  process  with  a  correct  detection  rate  of  90%  for  non-deceptive 
files. 

Another  interesting  aspect  is  that  independent  of  m,  the  conglomeration  areas  where 
"good"  features  appear  are  always  the  same:  For  example  the  half  of  the  "good"  features 
are  among  the  first  hundred,  but  between  200  and  300,  there  is  only  one. 

In  the  next  tables  we  will  use  the  following  abbreviations: 

Feature  number. 

Wrong  detection  within  the  deceptive  cluster  in  percent. 

Wrong  detection  within  the  non-deceptive  cluster  in  percent. 

Features  with  a  better  detection  rate  than  75%  at  least  in  one  of  the  two 
clusters. 

MINUS  Represents  the  difference  in  detection  rates  by  using  different  m' s. 


ft#: 

w_dcp: 

w_non: 

*• 


40See  Fig.8. 
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polydat_3 

erouD  #1  &  m=4 

’m=2’  MINUS  'm=4' 

ft  #  w_dcp  w_non 

% 

% 

1.0000  24.0000  18.0000 

* 

0 

-2.0000 

3.0000  32.0000  20.0000 

* 

2.0000 

0 

4.0000  22.0000  36.0000 

* 

- new  feature - 

5.0000  30.0000  32.0000 

2.0000 

0 

12.0000  40.0000  30.0000 

- new  feature - 

15.0000  24.0000  18.0000 

* 

0 

-2.0000 

17.0000  32.0000  20.0000 

♦ 

2.0000 

0 

18.0000  22.0000  36.0000 

* 

- new  feature - 

19.0000  30.0000  32.0000 

4.0000 

0 

22.0000  24.0000  28.0000 

♦ 

0 

0 

29.0000  36.0000  18.0000 

* 

0 

0 

30.0000  24.0000  20.0000 

♦ 

-2.0000 

4.0000 

31.0000  28.0000  32.0000 

-2.0000 

0 

33.0000  36.0000  32.0000 

0 

0 

36.0000  30.0000  16.0000 

* 

0 

0 

37.0000  16.0000  26.0000 

* 

0 

4.0000 

38.0000  24.0000  28.0000 

♦ 

0 

0 

39.0000  28.0000  26.0000 

-6.0000 

4.0000 

40.0000  32.0000  30.0000 

0 

0 

50.0000  34.0000  34.0000 

2.0000 

0 

52.0000  30.0000  40.0000 

— 

new  feature - 

68.0000  24.0000  36.0000 

* 

— 

new  feature - 

70.0000  40.0000  40.0000 

-2.0000 

0 

82.0000  32.0000  40.0000 

- - 

new  feature - - 

141.0000  34.0000  34.0000 

-4.0000 

0 

155.0000  34.0000  34.0000 

-4.0000 

0 

176.0000  32.0000  36.0000 

— 

■new  feature — - 

177.0000  32.0000  32.0000 

0 

0 

197.0000  26.0000  32.0000 

0 

2.0000 

200.0000  34.0000  26.0000 

-2.0000 

0 

202.0000  30.0000  28.0000 

0 

0 

211.0000  26.0000  32.0000 

0 

0 

214.0000  32.0000  26.0000 

0 

0 

216.0000  30.0000  28.0000 

0 

0 

235.0000  30.0000  38.0000 

-4.0000 

2.0000 

395.0000  38.0000  32.0000 

-new  feature - 

449.0000  32.0000  20.0000 

* 

0 

6.0000 

450.0000  24.0000  18.0000 

* 

0 

2.0000 

451.0000  24.0000  38.0000 

* 

-new  feature - 

453.0000  36.0000  36.0000 

0 

0 

458.0000  32.0000  26.0000 

6.0000 

i  -2.0000 

459.0000  40.0000  18.0000 

* 

-new  feature - 

460.0000  26.0000  38.0000 

-new  feature - 

462.0000  28.0000  34.0000 

0 

0 

600.0000  36.0000  40.0000 

0 

0 

for  the  abbreviations  see  page  46 

Fig.22:  Comparison  between  the  results  for  ’m=2’  and  ’m-4’ 
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polydat_3 

group  #1  &  m=5 

1 

| 

'm=5' 

ft  #  wdcp  w_non 

% 

% 

1.0000  24.0000  18.0000 

* 

0 

0 

3.0000  32.0000  20.0000 

* 

0 

0 

4.0000  24.0000  36.0000 

* 

-2.0000 

0 

5.0000  30.0000  32.0000 

0 

0 

12.0000  40.0000  30.0000 

0 

0 

15.0000  24.0000  18.0000 

♦ 

0 

0 

17.0000  32.0000  20.0000 

* 

0 

0 

18.0000  24.0000  34.0000 

* 

-2.0000 

2.0000 

19.0000  30.0000  32.0000 

0 

0 

22.0000  24.0000  28.0000 

* 

0 

0 

29.0000  36.0000  18.0000 

* 

0 

0 

30.0000  24.0000  20.0000 

♦ 

0 

0 

31.0000  28.0000  32.0000 

0 

0 

33.0000  36.0000  32.0000 

0 

0 

36.0000  30.0000  16.0000 

* 

0 

0 

37.0000  16.0000  26.0000 

* 

0 

0 

38.0000  24.0000  28.0000 

* 

0 

0 

39.0000  28.0000  26.0000 

0 

0 

40.0000  32.0000  30.0000 

0 

0 

50.0000  34.0000  34.0000 

0 

0 

52.0000  30.0000  40.0000 

0 

0 

68.0000  26.0000  36.0000 

-2.0000 

0 

70.0000  40.0000  40.0000 

0 

0 

82.0000  32.0000  40.0000 

0 

0 

141.0000  34.0000  34.0000 

0 

0 

155.0000  34.0000  34.0000 

0 

0 

176.0000  32.0000  36.0000 

0 

0 

177.0000  32.0000  32.0000 

0 

0 

197.0000  26.0000  34.0000 

0 

-2.0000 

200.0000  34.0000  26.0000 

0 

0 

— feature  #  202  is  missing — 

211.0000  26.0000  32.0000 

0 

0 

214.0000  34.0000  24.0000 

* 

-2.0000 

2.0000 

216.0000  30.0000  28.0000 

0 

0 

235.0000  30.0000  38.0000 

0 

0 

395.0000  36.0000  34.0000 

2.0000 

-2.0000 

449.0000  32.0000  22.0000 

* 

0 

-2.0000 

450.0000  24.0000  18.0000 

* 

0 

0 

451.0000  26.0000  36.0000 

-2.0000 

2.0000 

452.0000  10.0000  40.0000 

* 

- new  feature - 

453.0000  36.0000  36.0000 

0 

0 

458.0000  32.0000  28.0000 

0 

-2.0000 

459.0000  40.0000  20.0000 

* 

0 

-2.0000 

460.0000  28.0000  36.0000 

-2.0000 

2.0000 

462.0000  28.0000  34.0000 

0 

0 

600.0000  36.0000  40.0000 

0 

0 

for  the  abbreviations  see  page  46 

Fig.23:  Comparison  between  the  results  for  ’m=4’  and  ^=5’ 
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£  1.2.  Searching  for  the  best  feature  combination: 


4.1.2. 1.  Results  of  the  conventional  methods  and  general  observations: 

As  mentioned  in  chapter  3 . 1 .3 .2. 1 ,  we  decided  for  three  different  strategies  to  find  out  the 
best  feature  combination  that  can  represent  the  two  sought  clusters  within  the  polygraph 
data. 


After  a  short  while  of  a  "trial-and-error"  testing  with  the  multi-dimensional  clustering 
algorithm  and  achieving  some  experience  about  how  well  which  features  are  in  a 
combination  with  others,  I  decided  to  start  a  systematic  searching  process  beginning  with 
four-tuple  combinations.  In  the  followings,  I  will  mention  some  of  the  general 
observations41  we  made; 

•  not  always  all  of  the  good  one-dimensional  features  were  represented 
within  the  best  feature  combinations, 

•  good  one-dimensional  features  with  the  same  detection  rate  did  not 
provide  the  same  results  within  coequal  combinations, 

•  some  poor  or  average  individual  features  turned  out  to  be  the  best 
features  in  a  combination  with  others, 

•  by  repeating  some  features  in  a  combination,  we  obtained  a  few  new 
good  combinations, 

•  good  feature  combinations  always  gave  us  better  results  than  any  of  the 
features  individually  and 

•  the  quality  of  the  feature  tuple  does  not  depend  on  the  order  of  the 
features  within  the  tuple. 

In  the  following  tables,  you  see  an  example  for  using  the  random  search  method  for 
polydat_3  ('m=2'  and  ’m=5')  for  four-tuple  combinations. 


41See  also  chapter  4.3. 
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feature  number  **  { 1, 4, 3, 9, 22, 29, 30, 36, 37, 39, 450, 457, 458, 460] 

condition:  if  (  (  (m>=80)  &  (ww>=80))  \  (_  (nn>=86)  \  (ww>  =86)  )  ) 


table  1 


feature  nositions 

rieht  detection 

feature  nositions 

right  detection 

non-ok 

dcp-ok 

non-ok 

dcp-ok 

5 

1 

7 

4 

86 

78 

6 

4 

8 

5 

86 

68 

1 

7 

3 

6 

88 

72 

2 

4 

10 

6 

86 

68 

4 

8 

5 

2 

86 

76 

8 

4 

1 

5 

86 

70 

1  5 

6 

8 

4 

86 

68 

10 

8 

2 

1 

86 

72 

8 

3 

4 

5 

86 

72 

7 

9 

3 

1 

82 

80 

6 

8 

13 

5 

86 

68 

8 

1 

6 

14 

86 

70 

4 

1 

6 

3 

88 

70 

5 

4 

2 

8 

86 

76 

2 

3 

6 

1 

86 

74 

1 

7 

8 

6 

86 

70 

1 

8 

5 

3 

86 

72 

1 

4 

8 

10 

86 

72 

6 

12 

13 

8 

86 

68 

2 

12 

8 

1 

86 

76 

8 

1 

4 

6 

86 

70 

1 

2 

4 

8 

86 

76 

8 

7 

6 

1 

86 

70 

8 

1 

2 

4 

86 

76 

1 

8 

5 

6 

86 

70 

7 

3 

4 

2 

86 

78 

6 

3 

7 

1 

88 

72 

4 

1 

6 

8 

86 

70 

2 

6 

10 

1 

86 

68 

3 

6 

1 

4 

88 

70 

6 

10 

2 

7 

86 

68 

8 

1 

5 

10 

86 

72 

1 

3 

6 

5 

88 

70 

1 

8 

2 

4 

86 

76 

6 

7 

3 

1 

88 

72 

8 

4 

13 

1 

86 

70 

2 

6 

4 

1 

86 

72 

1 

10 

2 

6 

86 

68 

7 

5 

1 

4 

86 

78 

1 

6 

3 

5 

88 

70 

5 

8 

1 

4 

86 

70 

1 

5 

8 

3 

86 

72 

8 

5 

13 

3 

86 

72 

3 

8 

2 

6 

86 

72 

3 

8 

6 

14 

88 

70 

1 

6 

3 

14 

88 

70 

3 

7 

4 

2 

86 

78 

5 

1 

8 

2 

86 

76 

8 

7 

1 

6 

86 

70 

1 

4 

6 

10 

86 

68 

3 

1 

6 

5 

88 

70 

2 

5 

4 

8 

86 

76 

5 

4 

8 

2 

86 

76 

2 

6 

10 

1 

86 

68 

... 

feature  number  =  {1, 4, 3, 8, 9, 18, 22, 29, 30, 36, 37, 39, 81, 457} 

condition:  if  (  (  (nn>~80)  &  (ww>-80)) 

|  ((nn>=86)  &  (ww>=78) )  ) 

table  2 

feature  nositions 

richt  detection 

feature  nositions 

right  detection 

non-ok 

dcp-ok 

non-ok 

dcp-ok 

2 

3 

9 

14 

86 

78 

7 

1 

13 

9 

86 

78 

3 

5 

2 

9 

86 

78 

9 

3 

13 

2 

86 

78 

9 

3 

2 

4 

86 

78 

1 

9 

5 

4 

86 

78 

9 

1 

4 

5 

86 

78 

7 

3 

2 

9 

86 

78 

1 

4 

13 

9 

86 

78 

7 

9 

4 

1 

86 

78 

9 

4 

3 

2 

86 

78 

4 

2 

3 

9 

86 

78 

7 

1 

4 

9 

86 

78 

1 

7 

9 

4 

86 

78 

5 

7 

9 

1 

86 

78 

9 

1 

13 

5 

86 

78 

2 

9 

3 

7 

86 

78 

Fig.  24.1:  Feature  combinations  by  ’random  search'  -  polydat_3,  'm— 2’ 
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feature  positions  right  detection 


table  3 


non-ok 

dcp-ok 

8 

7 

6 

1 

86 

78 

7 

8 

1 

5 

86 

78 

3 

2 

8 

6 

86 

78 

3 

8 

5 

2 

86 

78 

1 

3 

10 

8 

82 

80 

3 

8 

2 

6 

86 

78 

3 

2 

13 

8 

86 

78 

2 

8 

5 

3 

86 

78 

1 

6 

5 

8 

86 

78 

5 

8 

3 

2 

86 

78 

1 

8 

13 

5 

86 

78 

6 

1 

8 

7 

86 

78 

2 

5 

8 

3 

86 

78 

5 

2 

3 

8 

86 

78 

3 

8 

6 

2 

86 

78 

3 

7 

2 

8 

86 

78 

2 

8 

5 

3 

86 

78 

7 

6 

1 

8 

86 

78 

3 

5 

2 

8 

86 

78 

8 

5 

6 

1 

86 

78 

7 

2 

3 

8 

86 

78 

8 

5 

6 

1 

86 

78 

7 

8 

2 

3 

86 

78 

7 

8 

6 

1 

86 

78 

8 

1 

7 

6 

86 

78 

1 

8 

5 

6 

86 

78 

1 

7 

6 

8 

86 

78 

5 

8 

1 

6 

86 

78 

6 

1 

5 

8 

86 

78 

7 

8 

5 

1 

86 

78 

8 

7 

2 

3 

86 

78 

8 

2 

3 

7 

86 

78 

6 

5 

1 

8 

86 

78 

1 

8 

7 

6 

86 

78 

6 

7 

8 

1 

86 

78 

1 

6 

13 

8 

86 

78 

6 

8 

13 

1 

86 

78 

8 

7 

1 

6 

86 

78 

5 

1 

7 

8 

86 

78 

2 

6 

8 

3 

86 

78 

3 

2 

8 

7 

86 

78 

1 

6 

8 

5 

86 

78 

2 

5 

8 

3 

86 

78 

8 

1 

5 

7 

86 

78 

2 

5 

3 

8 

86 

78 

feature  positions  right  detection 


1 

8 

10 

3 

non-ok 

82 

dcp-ok 

80 

1 

7 

8 

14 

86 

78 

6 

7 

1 

8 

86 

78 

10 

8 

1 

3 

82 

80 

5 

3 

2 

8 

86 

78 

7 

1 

6 

8 

86 

78 

6 

2 

8 

3 

86 

78 

7 

6 

8 

1 

86 

78 

8 

5 

3 

2 

86 

78 

1 

8 

6 

14 

86 

78 

3 

5 

8 

2 

86 

78 

7 

3 

8 

2 

86 

78 

8 

5 

2 

3 

86 

78 

8 

6 

7 

1 

86 

78 

8 

1 

5 

7 

86 

78 

1 

6 

13 

8 

86 

78 

7 

3 

8 

2 

86 

78 

6 

8 

1 

5 

86 

78 

5 

1 

8 

7 

86 

78 

1 

7 

13 

8 

86 

78 

1 

8 

5 

6 

86 

78 

8 

3 

2 

7 

86 

78 

6 

2 

8 

3 

86 

78 

8 

2 

3 

5 

86 

78 

6 

8 

2 

3 

86 

78 

8 

3 

6 

2 

86 

78 

2 

8 

3 

5 

86 

78 

2 

6 

3 

8 

86 

78 

5 

8 

1 

7 

86 

78 

8 

5 

13 

1 

86 

78 

1 

3 

8 

10 

82 

80 

7 

3 

2 

8 

86 

78 

3 

2 

5 

8 

86 

78 

3 

10 

1 

8 

82 

80 

8 

3 

1 

10 

82 

80 

8 

1 

5 

6 

86 

78 

3 

2 

13 

8 

86 

78 

1 

7 

8 

6 

86 

78 

3 

2 

5 

8 

86 

78 

2 

3 

8 

6 

86 

78 

5 

8 

13 

1 

86 

78 

8 

3 

13 

2 

86 

78 

8 

3 

5 

2 

86 

78 

8 

2 

3 

5 

86 

78 

6 

8 

2 

3 

86 

78 

Fig.  24.1:  Continued 


1  feature  number  =  {1, 4, 3, 8, 9, 21*  22, 30, 35, 36, 81, 198*  457, 459} 

condition:  if((  (nn>-80)&(ww>=80)  ) 

|  ( (nn>=86)  &  (ww>=78)  )  ) 

table  3 

feature  nositions 

right  detection 
non-ok  dcp-ok 

18  5  4 

86  78 

7  1  8  14 

86  78 

7  18  5 

86  78 

4  2  8  3 

86  78 

3  2  8  5 

86  78 

8  14  7 

86  78 

3  4  2  8 

86  78 

8  2  3  7 

86  78 

5  8  13  1 

86  78 

1  4  13  8 

86  78 

feature  number  =  {1, 4, 3, 8, 9, 22, 30, 35, 51,  111,  210, 455, 457, 459} 

condition:  if  (  ( (nn>=80)&(ww>=80)  )  \  ( (nn>=86)  &  (ww> ~79) )  ) 

feature  positions  right  detection 


table  4 


non-ok 

dcp-ok 

7 

5 

10 

6 

80 

80 

6 

4 

7 

10 

80 

80 

7 

4 

10 

5 

80 

80 

Fig.  24.1:  Continued 


feature  number  =  {1, 3, 4, 8, 9, 22, 30, 37, 81,  111,  452, 450, 459, 460} 

condition:  if(( (nn>=8Q)  &  (ww>=80) )  1  ( (nn>-86)  &  (ww>=79)  )  ) 


table  1 


feature  nositions 

right  detection 
non-ok  dcp-ok 

feature  nositions 

right  detection 
non-ok  dcp-ok 

1  12 

5 

9 

86 

80 

8 

5 

1 

2 

80 

80 

5  10 

2 

8 

80 

80 

8 

5 

2 

1 

80 

80 

6  12 

1 

9 

86 

80 

1 

6 

2 

8 

80 

80 

1  9 

7 

5 

86 

80 

10 

6 

2 

8 

80 

80 

10  9 

6 

7 

84 

82 

1 

9 

7 

14 

86 

80 

7  10 

9 

6 

84 

82 

1 

9 

8 

2 

80 

80 

2  1 

5 

8 

80 

80 

5 

12 

9 

8 

80 

80 

10  8 

7 

6 

80 

82 

3 

10 

8 

1 

80 

80 

7  4 

9 

1 

86 

80 

8 

12 

1 

3 

80 

80 

1  8 

2 

4 

80 

80 

1 

4 

8 

2 

80 

80 

1  7 

5 

9 

86 

80 

1 

12 

13 

9 

86 

80 

8  3 

1 

10 

80 

80 

10 

8 

2 

9 

80 

80 

5  8 

1 

2 

80 

80 

7 

9 

6 

1 

86 

80 

8  2 

4 

10 

80 

80 

9 

5 

7 

10 

84 

82 

5  12 

7 

3 

82 

80 

2 

1 

4 

8 
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Fig.  24.11:  Feature  combinations  by  ’random  search’  -  polydat_3,  ’m=5’ 
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Fig.  24.11:  Continued 


After  running  similar  simulations  for  different  m's  with  randomly  chosen  features  from  the 
pool  of  the  aforementioned  five42  groups,  I  started  a  sequence  of  pseudo-exhaustive 
searches  with  those  features  from  which  we  received  good  results  by  random  search. 

For  this  sequence  of  simulations  the  parameter  m  was  set  equal  to  5.  We  started  with 
four-tuple  combinations  out  of  a  pool  of  14  features  (4/14).  We  then  gradually  increased 
the  number  of  the  features  -  within  the  tuple  and  the  pool  -  up  to  8/22.  To  run  the 
simulation  with  this  final  setting,  we  needed  a  computation  time  of  several  weeks. 

In  the  following  figures,  you  see  an  example  for  one  of  the  best  4-tuple  results  we 
obtained  for  the  polydat_3: 

4-tuple  combination:  81  &  111  &  450  &  45243. 

dimension:  polygraph  session. 

correct  detection  rate:  84%  for  non-deceptive  and  86%  for  deceptive  files. 

dimension:  polygraph  examination44  -  containing  1  to  4  sessions. 

correct  detection  rate:  89%  for  non-deceptive  and  94%  for  deceptive  files. 

dimension:  polygraph  examinations  with  more  than  two  sessions. 

detection  rate:  100%. 

42See  Fig.  8  for  four  of  them  and  page  25  for  the  additional  fifth  one. 

43For  information  about  the  exact  meaning  of  these  feature  numbers,  see  Fig.41. 

44See  "Evaluation  strategy"  in  chapter  3. 1.3.4. 
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Fig.25:  Defuzzifled  results  for 
[81-111-450-452]  feature  combination 
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Fig.25:  Continued 
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4. 1.2.2.  Results  of  the  genetic  method: 


Simultaneously  to  the  aforementioned  sequence  of  searches,  I  started  with  a  compromise 
between  the  random  and  the  pseudo-exhaustive  search  method;  i.e.  the  genetic  alternative. 
I  decided  to  use  this  method  in  two  different  ways: 

1 .  In  order  to  increase  the  number  of  potentially  good  features  in  the  pool,  I 
initialized  the  genetic  code  with  up  to  50  features  from  which  (in  different 
simulations)  4-,  6-,  8-tuple  combinations  were  made. 

2.  In  order  to  accelerate  the  search,  but  process  the  data  more  exhaustively, 

I  decided  to  use  the  genetic  code  only  for  the  best  features  from  random 
and  pseudo-exhaustive  simulations  and  narrow  the  feature  pool  to  these 
30  selected  features.  In  this  simulation,  15-tuple  combinations  were  made. 


Recall  that  having  30  or  50  features  in  the  pool  makes  a  big  computation  difference.  For 
example,  choosing  exhaustively  8-tuples  out  of  50  or  30  features  makes  a  difference  of 
following  number  of  computations: 


'50> 

j 


'30^ 

,8> 


50! 


30! 


8!(50  —  8)!  8!(30— 8)! 
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In  the  first  part  of  the  genetic  search  -  as  expected  -  we  had  similar  problems  as  scientists 
have  with  the  theory  of  evolution  as  the  cause  of  our  being45.  The  only  way  we  could  get 
the  following  good  results  was  the  continuous  manipulating  of  the  evolution  process  -  by 
changing  parameters  (like  mutation  rate),  features  (=genes)  and  feature  numbers 
(population  size  and  also  number  of  genes  in  one  chromosome),  or  by  starting  again  if 
the  simulation  began  with  a  very  low  detection  rate  (=average  fitness).  In  spite  of  these 
manipulations  the  first  version  of  the  genetic  search  took  a  simulation  time  of  over  two 
months  of  continuous  computation.  Without  the  constant  controlling  process  over  this 
genetic  system  the  evolution  (by  chance  as  it  is  its  nature)  could  have  hardly  provided  any 
appropriate  improvement46.  As  a  result  we  obtained  12  (see  Fig.26)  8-tuples  combination 


45Further  discussion  about  "evolution  vs.  creation"  would  break  up  the  limitations  of  this  project;  For 
interested  readers  I  recommend  the  following  references:  [Morrisl987]  [Johnsonl991], 

^For  example,  one  of  the  uncontrolled  simulation  for  polydat_l  was  stopped  after  561  generations 
providing  no  particular  results. 
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with  an  average  of  85%  correct  detection  rate  for  polydat_3  similar  to  the  results  of  the  4- 
tuple  combination  mentioned  in  chapter  4. 1.2.1.  We  also  obtained  3  outstanding  (86% 
correct  detection  rate)  individuals  within  three  different  generations  (population  size  of 
200  to  300,  total  number  of  generation  1000,  polydat_3). 


feature  numbers  of  the  best 

8-tuple  combinations 

correct  detection  rate 
ndcp  dcp 

8,30,81 ,81,111 ,363,458,482 

84 

86 
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84 

86 
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84 

86 

9,37,111,  111,235,449,457,460 

84 

86 

37 , 79 ,  111 ,  111 ,  197 , 358 , 449 , 457 

84 

86 

37  ,  111 ,  111 ,  197 , 449 , 457 , 460 , 460 

84 

86 

37  ,  111 ,  111 ,  111 ,  235 , 358 , 457 , 458 

84 

86 

37 ,111  ,  111, 235 , 235 , 449 , 453 , 457 

84 

86 

37,111,111,197,358,361,458,460 

84 

86 

37 , 81 ,  111 ,  235 , 235 , 363 , 450 , 453 

86 

84 

37 , 81 ,  111 ,  235 , 235 , 359 , 450 , 453 

86 

84 

37,79,  111 ,  111 ,  197,235,449,457 

86 

86 

37,111,111,235,235,453,457,460 

86 

86 

37 , 11 1  ,  11 1  ,  197 , 235 , 452 , 457 , 460 

86 

86 

ndcp:  non-deceptive  files 
dcp:  deceptive  files 
data:  poly  dal 

Fig.26:  Results  of  the  first  version  of  the  genetic  search 

Concerning  the  defuzzified  results,  all  the  combinations  with  85%  correct  detection  rate 
show  similar  structure  as  depicted  in  Fig. 25.  The  three  best  8-tuple  combinations  (86 /o 
correct  detection  rate)  cluster  the  data  exactly  in  the  same  groups  as  shown  in  the 
following  figure. 
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Fig.27:  Defuzzified  results  for 
[37-111-111-197-235-452-457-4601  feature  combination 
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Fig.27:  Continued 
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The  followings  are  the  clustering  results  of  the  best  8-tuple  combinations  for  polydat_3: 


dimension: 

correct  detection  rate: 


polygraph  session47. 

86%  for  both  non-deceptive  and  deceptive  files. 


dimension: 

correct  detection  rate: 


polygraph  examination  -  containing  1  to  4  sessions. 
94%  for  both  non-deceptive  and  deceptive  files. 


dimension: 
detection  rate: 


polygraph  examinations  with  more  than  two  sessions 
97%. 


In  the  second  part  of  the  genetic  search  as  we  fed  the  evolution  process  with  the  best 
features,  we  obtained  after  about  3  weeks  of  continuous  simulation  the  following  results: 


twelve  15-tuple  combinations:  (the  features  in  each  tuple  are  ordered  vertically) 
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Fig.28:  Results  of  the  second  version  of  the  genetic  search 


47See  "Evaluation  strategy"  in  chapter  3. 1.3.4. 


4-60 


Fig.29:  Average  fitness  of  each  generation 
provided  by  the  second  version  of  the  genetic  search 


As  you  see  in  this  figure,  the  average  fitness  (from  all  the  chromosomes  within  a 
generation)  increases  over  the  period  of  time.  It  then  approaches  a  local  asymptote  which 
represents  a  local  error  minimum.  By  increasing  the  mutation  rate  after  the  150th 
generation,  we  could  avoid  being  stuck  in  that  local  minimum  for  further  development. 
This  higher  mutation  rate  helped  the  evolution  process  getting  a  1%  better  average  fitness 
per  generation  for  the  rest  of  the  simulation. 

Our  hope  for  this  simulation  was  to  get  outstanding  chromosomes  with  a  very  high  fitness 
simultaneously  to  the  increasing  process  of  the  average  fitness  per  generation.  However, 
the  outstanding  chromosomes  appeared  unsystematically  in  different  generations  and  not 
at  the  end.  In  fact,  most  of  them49  belong  to  the  first  part  of  this  evolution. 

48See  the  begining  of  this  chapter  for  more  details. 

49See  Fig.28  for  the  best  feature  combinations. 
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4. 1.2.3.  Final  results  of  FCM-  A  comparison  between  all  three polydat_i's: 

All  the  aforementioned  results  belong  to  the  data  set  polydat_3,  and  all  the  three  methods, 
(1)  previous  researches  using  the  fuzzy  K-nearest  neighbor  (KNN)  classifier,  (2)  the  LMS 
fuzzy  adaptive  filter  and  also  (3)  the  fuzzy-c-means  algorithm  show  that  the  data  structure 
within  the  polydat_3  is  better  to  cluster  or  classify  than  the  other  two  sets. 

As  it  is  the  nature  of  a  clustering  versus  a  classifying  method,  I  did  not  set  the  highest 
priority  on  finding  the  same  best  features  for  all  three  polydat_i's,  but  for  each  of  them 
individually.  After  finding  those  best  combinations,  I  then  compared  the  results  and  tested 
the  consistency  of  the  features  (see  Fig.  33,  34,  35). 


Using  either  sessions  or  examinations 50  as  the  counting  dimension  the  best  results  for  each 
polydat_i  individually  are  shown  in  the  following  figures. 


data 

average  correct  detection  rate 

polydat_l 

81% 

polydat_2 

79% 

polydat_3 

86% 

Fig.30:  Clustering  results  using  individual  features 
(using  sessions  as  the  counting  dimension) 


data 

average  correct  detection  rate 

polydat_l 

91% 

polydat_2 

82% 

polydat_3 

94% 

Fig.31:  Clustering  results  using  individual  features 
(using  examinations  as  the  counting  dimension) 

50See  "Evaluation  strategy"  in  chapter  3. 1.3.4. 
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data 

average  correct  detection  rate 

polydat_l 

93% 

polydat_2 

87% 

polydat_3 

97% 

Fig.32:  Clustering  results  using  individual  features 
(counting  only  those  examinations  with  more  than  two  sessions) 

In  the  following  figures,  a  comparison  between  the  three  polydat_i's  were  made  using  the 
best  feature  combination  for  one  of  the  polydat_i's  at  a  time  and  testing  it  for  the  other 
two  ones.  As  you  will  see,  the  best  result51  -  while  taking  the  same  features  for  each 
polydat_i  -  is  79.7%  for  the  feature  combination52  [9,  30,  81,  197,  478,  111],  and  in 
average  79.3%. 


feature  tuple 

37,  79,  1 1 1,  1 1 1,  197,  235,  449,  457 
37,  111,111,  197,  235,  452,  457,  460 
37,  111,  111,235,  235,  453,  457,  460 

30.81.81,  111,  197,  458 

9.30.81,  111,  197,  458 

8,  37,  50,  79,  111,  111,  235,  235, ... 
358,  452, 453, 478,  478, 478,  482 


polydat_i 

i=3 

i=2 

i=l 

86% 

77% 

75% 

86% 

77% 

75% 

86% 

77% 

74% 

85% 

79% 

73% 

85% 

79% 

73% 

85% 

76% 

76% 

Fig.33:  Comparison  #1  (dimension:  sessions) 

(taking  some  of  the  best  polydat_3  feature  tuples  and  testing  it  for  the  others) 

For  the  exact  labels  of  this  feature  numbers  see  appendix,  Fig.42. 


51With  polygraph  sessions  as  the  counting  dimension. 
52See  Fig.35,  "Comparison  #3". 
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feature  tuple 

i=l 

polydat_i 

i=2 

i=3 

9,  30,  30,  39,  235,  450 

80% 

75% 

81% 

30,  30,  39,  50, 235,  450 

80% 

75% 

81% 

30,  30,  39,  81,235, 450 

80% 

75% 

81% 

30,  30,  39,  197, 235, 450 

81% 

74% 

82% 

30,  30,  39,  235, 363, 450 

81% 

75% 

81% 

30,  30,  39,  235,  358,  450  ' 

80% 

76% 

81% 

30,  30,  39,  235, 450, 458 

80% 

75% 

81% 

30,  30,  39,  235, 482,  450 

80% 

75% 

81% 

30,  30,  39,  235,  361,  450 

80% 

75% 

81% 

30,  30,  39,  235,  359, 450 

80% 

75% 

81% 

30,  30,  39,  235, 450,  457 

80% 

75% 

81% 

30,  39,  235,  363, 450, 482 

80% 

72% 

83% 

30,  39,  235,  363,  450,  478 

80% 

71% 

83% 

Fig.34:  Comparison  #2  (dimension:  sessions) 

(taking  some  of  the  best  polydat_l  feature  tuples  and  testing  it  for  the  others) 


4-64 


polydat_i 

feature  tuple 
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i=i 

i=3 

9,  30,  81,  197,478,  111 

79% 

75% 

85% 

9,30,  50,81,  197,  111 

79% 

74% 

85% 

9,30,81,358, 197,  111 

79% 

74% 

85% 

9,  30,81,359, 197,  111 

79% 

74% 

85% 

9,30,81,  197, 457,  111 

79% 

74% 

85% 

30,81,105,111,  197,358 

79% 

74% 

84% 

30,81,105,111, 197, 359 

79% 

74% 

84% 

30,81,105,111,  197,457 

79% 

74% 

85% 

30,81,  105,  111,  197,  459 

79% 

74% 

84% 

30,81,  111,  197,358,  359 

79% 

74% 

85% 

30,81,  111,  197,358,456 

79% 

74% 

85% 

30,81,  111,  197,358, 457 

79% 

74% 

85% 

30,81,111,  197,358, 459 

79% 

74% 

85% 

30,81,  111,  197,359,456 

79% 

74% 

85% 

30,81,  111,  197,359,  457 

79% 

74% 

85% 

30,81,  111,  197,359, 459 

79% 

74% 

85% 

30,  81,  111,  197,  456,457 

79% 

73% 

85% 

30,81,111,  197,456,459 

79% 

74% 

85% 

30,81,  111,  197,  457,459 

79% 

74% 

85% 

30,  105,  111,  197,359, 459 

79% 

74% 

84% 

30,  105,  111,  197,  456,  459 

79% 

74% 

84% 

30,  105,  111,  197,  457,  459 

79% 

74% 

85% 

30,  105,  111,  197, 456, 457 

78% 

74% 

85% 

30,  111,  197,358,359, 459 

78% 

74% 

85% 

30,  111,  197,  358,  456,  459 

78% 

74% 

85% 

30,  111,  197,358,  457, 459 

78% 

74% 

85% 

30,  111,  197, 456, 457,459 

78% 

74% 

85% 

Fig.35:  Comparison  #3  (dimension:  sessions) 

(taking  some  of  the  best  polydat_2  feature  tuples  and  testing  it  for  the  others) 
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4.2.  LMS  fuzzv  adaptive  filter 


The  first  test  we  did,  was  to  find  the  performance  of  the  filter  before  any  training.  That  is, 
we  used  the  classifier  as  a  conventional  fuzzy  logic  system  designed  solely  based  on  the 
four  linguistic  rules  mentioned  above.  The  results  are  listed  in  the  following  table. 


correct  detection  rate  in 


polydatj 

non-deceptive  files 

deceptive  files 

average 

i=l 

70% 

72% 

71% 

i=2 

70% 

76% 

73% 

i=3 

70% 

88% 

79% 

Fig.36:  Results  based  solely  on  4  aforementioned  linguistic  rules 

without  any  training 

Note  that  the  percentage  of  correct  recognition  for  non-deceptive  subjects  are  the  same 
for  polydatj,  polydat_2,  and  polydatj,  because  they  are  all  the  same  data53.  Also  note 
that  the  results  are  best  for  polydatj,  as  it  was  for  KNN  and  FCM.  This  may  be  partially 
due  to  polydatj's  good  performance  in  general,  independent  of  the  classifying  schemes. 
We  believe  that  it  may  also  be  a  result  of  us  setting  up  the  linguistic  rules  by  having 
observed  polydatj. 

However,  the  outcomes  for  polydatj  and  polydatj  are  good  enough  such  that  one  can 
be  sure  the  linguistic  rules  are  sufficiently  general  even  for  data  that  we  did  not  examine. 

As  mentioned  in  chapter  3.2.3,  we  then  tested  the  fuzzy  LMS  algorithm  trained  with 
twenty  training  data  (ten  deceptive  and  ten  non-deceptive)  and  again  with  seventy  training 
data  (thirty-five  deceptive  and  thirty-five  non-deceptive)  for  the  three  sets  of  data,  for  a 
total  of  six  tests.  Twenty  trials  were  performed  for  each  test,  and  the  system  was 
initialized  with  the  linguistic  rules  before  each  trial.  The  training  data  were  randomly 
chosen  for  each  trial,  and  the  rest  of  the  available  data  in  each  set  were  for  testing. 


53See  polygraph  files  on  chapter  6.2. 
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We  computed  the  percentage  of  correct  recognition  of  testing  data  for  each  trial, 
averaging  the  performance  for  deceptive  and  non-deceptive  subjects.  The  recognition  rate 
of  those  twenty  trials  are  averaged,  rounded  to  two  digits,  and  reported  in  the  following 
table.  The  sample  standard  deviations  are  also  shown. 


correct  detection  rate 

polydat_i 

version  #1 

version  #2 

i=l 

75%  (6%) 

73%  (2%) 

i=2 

74%  (7%) 

73%  (3%) 

i=3 

78%  (6%) 

79%  (2%) 

version  #1:  70  training  &  30  testing  sessions 
version  #2 :  20  training  &  80  testing  sessions 
(standard  deviation  in  parentheses) 

Fig.37:  Average  percentage  of  correct  detection  rate 
for  twenty  trials  of  each  test 


As  may  be  expected,  the  recognition  rate  improves  in  general  when  training  data  is  used, 
as  compared  to  the  results  of  the  untrained  system.  Also,  the  recognition  rate  is  typically 
higher  when  the  system  is  trained  with  more  data.  The  difference,  however,  is  not 
dramatic.  The  use  of  training  data  offers  small  incremental  improvements.  The  one 
exception  would  be  for  data  set  polydat_3.  Here  more  training  data  seems  to  lower  the 
performance.  The  effect  is  probably  due  to  the  fact  that  the  initialization  of  the  reasoning 
rules  were  based  on  our  examination  of  polydat_3,  which  covered  all  100  data.  Yet  the 
training  algorithm  was  to  learn  only  a  subset  of  that,  so  it  was  handicapped  compared  to 
human  reasoning. 

Human  reasoning  may  also  be  better  in  this  case  because  the  training  algorithm  only 
attempts  to  optimize  the  system  in  the  least  mean  square  sense,  slightly  different  than  our 
ultimate  goal  of  maximizing  recognition  rate.  At  any  rate,  when  the  standard  deviation  is 
taken  into  account,  the  difference  in  recognition  rate  becomes  insignificant. 

Another  noticeable  difference  between  the  results  using  different  amounts  of  training 
samples  is  the  value  of  the  sample  standard  deviation.  A  large  number  of  testing  data  leads 
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to  a  small  standard  deviation.  Conversely,  a  small  amount  of  testing  data  leads  to  a  large 
standard  deviation.  This  confirms  what  we  intuitively  know;  the  average  percentage  of 
correct  recognition  is  more  accurate  when  a  large  amount  of  testing  data  is  available. 

The  above  observations  illustrate  a  practical  issue  in  using  many  adaptive  and  learning 
algorithms,  that  of  partitioning  a  limited  amount  of  data  into  training  and  testing  sets.  For 
most  algorithms,  too  much  data  in  training  and  little  in  testing  leaves  little  assurance  about 
the  performance  of  the  system.  On  the  other  hand,  too  much  data  in  testing  and  little  in 
training  assures  mediocre  performance  from  the  system. 

More  data  for  both  training  and  testing  would  help,  but  many  times  that  may  not  be 
available.  Fuzzy  logic  systems  mitigate  this  problem  by  exploiting  linguistic  information. 
Unlike  neural  networks  and  many  statistical  techniques,  which  are  completely  dependent 
on  numerical  data,  this  fuzzy  LMS  algorithm  uses  numerical  data  mainly  to  optimize  a 
good  fuzzy  system.  The  above  results  show  that,  given  good  initialization  of  the  reasoning 
rules,  the  system  can  perform  well  even  with  little  or  no  training  data.  This  robustness  is 
one  of  the  many  advantages  of  fiizzy  logic. 
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4.3.  Other  observations: 


During  this  project,  aside  from  the  results  and  conclusions  we  were  looking  for,  we  also 
obtained  several  side  results.  In  this  passage,  I  will  mention  some  of  the  interesting 
observations  we  made. 


1;  As  mentioned  before,  the  fuzzy-c-means  (FCM)  algorithm  is  initialized  by  random 
chosen  membership  values  which  will  be  modified  and  optimized  during  the  iterative 
process.  Thus,  FCM  algorithm  is  almost  independent  of  the  initial  membership  values. 
During  our  testing  process,  we  noticed  that  the  FCM  algorithm  is  not  absolutely 
independent  of  the  initial  values.  Thus,  it  is  possible  that 

•  the  algorithm  may  run  into  different  local  minima  or 

•  because  of  its  unsupervised  nature,  the  algorithm  may  switch  the  clusters, 
i.e.  if  -  depending  on  our  interpretation  -  the  first  cluster  represents  the  non- 
deceptive  and  the  second  one  the  deceptive  files,  it  might  be  the  opposite 
while  using  other  initial  random  values. 

To  avoid  any  misinterpretations,  I  decided  to  create  two  sets  of  random  membership 
values  (for  c=2  and  c=3)  and  save  them  as  fixed  initialization  values  for  any  further 
simulations.  In  the  following  figure, '+'  represents  the  non-deceptive,  the  decptive  files; 


Fig.38:  Fixed  initial  random  membership  values  for  c=2 
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2.  "Outlier  effect": 

In  the  real  world  of  using  an  automated  polygraph  system  as  suggested  in  this  project,  we 
have  to  keep  in  mind  the  existence  of  the  outlier  effect.  This  occurs,  for  instance,  when  a 
non-deceptive  person  (=  membership  value  between  zero  and  0.5)  becomes  misclustered 
in  a  deceptive  data  space  with  a  very  high  membership  value  close  to  one.  In  other  words, 
if  a  normal  non-deceptive  person  gets  labeled  as  very  deceptive,  or  vice-versa. 

We  noticed  this  phenomenon  in  both  clustering  and  classifying  algorithms54.  We  also 
noticed  that  by  making  the  system  "fuzzier"  -  e.g.  higher  m  or/and  c  for  FCM  -  as 
expected,  the  outlier  effect  can  be  reduced,  but  not  eliminated  though. 

3.  "Performance  limitations": 

There  seem  to  be  a  limit  in  recognition  rate  using  the  features  available  by  both  fuzzy 
algorithms  used  in  this  project  and  also  by  fuzzy  k-nearest  neighbor  algorithm  used  in 
previous  works  [Layeghi 1993,1]  [Dastmalchil993]  for  all  the  available  polydat_i's.  There 
may  also  be  psychophysiological  limitation  on  the  recognition  rate.  However,  polydat_3 
provided,  independent  of  all  the  three  algorithms,  the  best  results  compared  to  the  other 
two  polydat_i's. 


54See  also  "Epilogue". 
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4.3.  A  COMPARISON 


BETWEEN  THE  THREE  FUZZY  ALGORTHMS  USED  IN  THIS 
AND  THE  PREVIOUS  PROJECT 

(FUZZY-C-MEANS,  LMS  FUZZY  ADAPTIVE  FILTER  AND  FUZZY  K-NEAREST  NEIGHBOR) 


The  fuzzy  LMS  system  is  unique  in  its  application  of  linguistic  knowledge.  As  mentioned 
earlier,  the  use  of  linguistic  knowledge  ensures  the  robustness  of  the  fuzzy  system.  The 
use  of  linguistic  information  also  ameliorates  the  problem  of  not  having  enough  reliable 
numerical  data.  Unlike  classification  schemes  such  as  the  K-Nearest  Neighbor,  the  fuzzy 
LMS  algorithm  is  not  entirely  dependent  on  numerical  data. 

When  applied  to  pattern  recognition,  fuzzy  logic  systems  can  be  set  up  to  perform  like 
KNN  systems.  In  KNN  systems,  numerical  data  of  known  class  patterns  are  set  up  to 
estimate  the  probability  density  distribution  of  the  classes.  The  probabilities  of  new  data 
points  belonging  to  the  different  classes  are  then  computed  based  on  such  distribution. 
Data  points  around  known  class  samples  are  then  classified  into  the  same  class  with  a 
higher  probability.  The  fuzzy-KNN  algorithm  modifies  the  classical  KNN  algorithm  by 
taking  into  account  the  distance  between  the  data  point  and  the  known  class  patterns  when 
estimating  the  probability.  Conceptually  this  is  similar  to  setting  up  clusters  around  all 
known  class  samples  and  calculating  the  degree  of  belonging  of  new  data  points  in  the 
different  types  of  clusters.  Other  than  the  exact  mathematical  equations,  that  description 
fits  a  fuzzy  adaptive  system  where  each  rule  corresponds  to  a  known  class  pattern  and  the 
size  of  the  clusters  is  the  same  for  all  rules. 

However,  fuzzy  adaptive  systems  give  up  some  of  the  nice  theoretical  understandings  of 
the  KNN  systems  but  gain  some  practical  advantages.  The  number  of  rules  required  are 
usually  much  smaller  than  the  number  of  known  samples.  Fuzzy  logic  can  usually  exploit 
that  to  reduce  system  complexity. 

Furthermore,  the  system  complexity  for  a  fuzzy  adaptive  system  stays  the  same  even  as 
new  information  are  available.  This  is  partly  a  result  of  the  way  this  algorithm  adapt 
continuously;  new  information  are  learned  as  old  ones  are  forgotten.  The  fuzzy  LMS 
learning  technique  is  like  backpropagation,  a  popular  neural  network  training  technique. 
However,  the  fuzzy  LMS  learning  algorithm  requires  few  epochs  for  training.  In  all  our 
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trials  the  maximum  recognition  rates  for  testing  data  peaked  in  less  than  thirty  epochs. 
About  95%  of  them  peaked  in  less  than  twenty  epochs55.  This  is  a  few  orders  of 
magnitude  less  than  most  applications  of  backpropagation.  In  many  cases  the  peaks 
occurred  before  any  training;  that  is,  the  system  uses  only  linguistic  rules.  Here  the  use  of 
expert  knowledge  speeds  up  the  training  of  the  system. 

The  fuzzy-c-means  algorithm,  unlike  fuzzy  LMS,  is  an  unsupervised  clustering  algorithm. 
Given  a  set  of  data,  FCM  looks  for  a  (usually)  predetermined  number  of  clusters  within 
the  data  points.  It  does  not  use  any  knowledge  about  the  correct,  or  desired  classification 
of  any  of  the  elements.  The  algorithm  only  minimizes  an  objective  function,  which  is  the 
sum  of  a  function  of  the  data  points’  membership  values  and  the  distances  between  the 
data  points  and  the  clusters'  centers. 

FCM  operates  like  a  black  box;  given  some  data,  the  algorithm  automatically  computes 
the  results56.  This  presents  the  advantage  that  different  sets  of  data  using  different  features 
can  be  tested  in  a  routine  manner.  FCM  also  presents  a  way  to  normalize  the  different 
dimensions  of  the  data,  just  like  the  use  of  sigma  in  the  fuzzy  LMS  algorithm.  However, 
unlike  fuzzy  LMS,  FCM  does  not  present  a  method  to  find  the  optimal  way  for  such 
normalization. 

The  fuzzy  LMS  algorithm,  however,  does  pose  some  potential  problems  of  its  own.  The 
use  of  expert  knowledge,  while  a  benefit  in  some  senses,  may  not  be  always 
straightforward.  For  example,  in  our  project  we  did  not  have  any  specific  knowledge 
about  the  polygraphy  itself.  Whatever  we  learned,  we  learned  by  looking  at  numerical 
data.  As  we  tried  to  find  more  complicated  patterns,  patterns  involving  three,  four,  or 
more  features,  the  analysis  became  more  difficult.  Naturally  one  wishes  to  automate  this 
process.  If  we  do  not  rely  on  some  learning  procedures,  however,  rules  cannot  be 
automatically  found  for  the  fuzzy  system.  Much  research  also  needs  to  be  done  to 
understand  the  fuzzy  LMS  algorithm's  learning  dynamics.  While  the  same  method, 
gradient  descent,  is  used  on  both  backpropagation  and  the  fuzzy  LMS  algorithm,  the 
general  shapes  of  the  error  surface  between  the  two  are  different.  In  backpropagation,  all 
the  parameters  have  the  same  range  and  lie  in  an  uniform  neural  network  structure.  In  the 
fuzzy  LMS  algorithm,  the  parameters  can  have  different  ranges  and  lie  a  fuzzy  logic 


5 ^However,  we  ran  every  trial  to  forty  epochs  to  ensure  that  there  is  no  false  peak. 
560ur  job  is  basically  to  adjust  the  parameters. 
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structure  that  is  not  completely  uniform.  The  effects  of  such  differences  on  the  shape  of 
the  error  surface  and  the  learning  dynamic  are  unknown. 

In  the  following,  I  will  mention  again  some  of  the  results  we  obtained  by  using  different 
fuzzy  clustering  or  classifying  algorithms.  Recall  that  also  the  searching  strategies  to  find 
the  best  features  -and  feature  combinations-  were  different  for  each  of  the  aforementioned 

algorithms57. 


polydat_i 

hi 

i^2 

i=3 

fuzzy-c-means58 

91% 

82% 

94% 

fuzzy-c-means59 

93% 

87% 

97% 

fuzzy  K-nearest-neighbor 

86% 

80% 

91% 

LMS  fuzzy  adaptive  filter 

81% 

83% 

83% 

fuzzy-c-means60 

81% 

79% 

86% 

The  results  are  rounded. 

Fig.39:  Comparison  between  different  fuzzy  algorithms 
used  for  polygraph  classification  in  this  and  in  the  previous  research 


The  results  of  our  fuzzy  LMS  system,  while  impressive  for  such  a  simple  set-up,  are  not 
comparable  to  the  results  of  the  same  project  using  other  systems.  We  believe  that  the 
recognition  rate  will  increase  for  few  percentage  points  by  using  the  suggestions  in  chapter 


57See  the  following  chapters  3.1.3.1,  3.1.3.2.1  -  4  for  the  searching  strategies  used  for  the  FCM, 

chapter  3.2. 1  for  the  visual  inspection  used  for  the  LMS  system, 

and  chapter  III.3.3.  in  [Layeghi 1993,1]  for  the  methods  used  for  the  KNN. 

58FCM  using  examinations  as  the  counting  dimension  (see  chapter  4. 1.2. 3.  and  Fig.3 1). 
ss>The  same  as  above  but  counting  those  examinations  with  more  than  2  sessions  (see  Fig.32). 

60Since  we  took  35  out  of  50  available  non-deceptive  sessions  for  training  the  LMS  filter,  it  would  be 
meaningless  to  evaluate  this  algorithm  by  examinations  as  the  counting  dimension.  Yet,  in  order  to  make 
it  comparable  to  the  other  algorithms,  the  results  of  the  FCM  with  sessions  as  the  counting  dimension  are 
also  shown. 
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§5.  FUTURE  STEPS  AND  SUGGESTIONS 


5.7.  The  algorithms: 


As  mentioned  earlier  in  chapter  2.2.3.  about  the  fuzzy-c-means  algorithm,  the  performance 
of  this  clustering  model  is  influenced  by  the  choice  of  various  parameters.  In  this  project,  I 
tried  to  find  the  optimum  values  of  the  majority  of  them.  However,  there  are  several  other 
points  which  should  be  studied  more  comprehensively:  They  are 

•  the  initial  cluster  centers, 

•  the  order  in  which  the  samples  are  taken  as  input, 

•  the  choice  of  distance  measure, 

•  the  termination  criteria  and 

•  the  geometrical  properties  of  the  data. 

Most  imprtantly,  more  information  about  the  geometrical  arrangement  of  the  data  points 
and  the  appropriate  choice  of  the  norm  could  help  us  improve  the  clustering  algorithm. 
There  are  several  suggestions  in  [Bezdekl981]  [Bezdekl992]  [IIScorpl993]  for  a  better 
understanding  of  the  algorithm's  dynamics  and  for  making  systematic  decisions  concerning 
different  types  of  distance  norms  and  elliptical  cluster  shapes. 

For  future  studies,  I  highly  recommend  a  deeper  investigation  of  our  clustering  algorithm 
by  setting  c=3  and  trying  defuzzification  thresholds  other  than  0.5. 


In  this  project,  we  decided  to  systematically  test  the  FCM  algorithm  with  different  values 
of  m  to  find  its  optimum.  For  additional  (and  more  theoretical)  investigations,  I  suggest 
[Choel992]  as  an  introductory  step.  It  may  be  also  helpful  to  use  different  values  of  m  for 
different  sessions  simultaneously,  while  looking  for  the  most  realistic  clusters  within  the 
entire  session  space. 

An  exciting  additional  investigation  would  be  a  new  polydat  made  up  of  the  best  clustered 
sessions  of  our  three  polydat_i's  as  a  reference  for  any  further  clustering  process.  By  doing 
this  we  could  give  the  algorithm  a  better  chance  to  cluster  correctly  even  the  critical 


sessions. 


Concerning  the  LMS  adaptive  algorithm,  one  may  investigate  the  effect  of  changing  the 
learning  factor;  throughout  our  experiment  it  remained  at  0.005.  Upon  observing  the 
quickness  of  learning  in  our  testing,  we  believe  the  learning  factor  can  be  decreased  in  the 
future. 

We  also  believe  that  there  should  not  be  just  one  but  at  least  three  different  learning 
factors:  one  for  the  cr's,  one  for  the  0's,  and  one  for  the  x/s;  because  these  three  types  of 
parameters  lie  in  a  very  irregular  parameter  space,  unlike  that  of  backpropagation  where 
all  parameters  lie  in  a  more  or  less  uniform  parameter  space. 

For  illustration,  the  three  types  of  parameters  comapred  to  one  another  have  very  different 
numerical  ranges.  Conceptually  speaking,  a  parameter  with  a  large  range  of  movement 
should  generally  have  a  larger  learning  factor  than  one  with  a  smaller  range  of  movement. 
However,  the  gradient  and  the  general  shape  of  the  error  surface  would  also  affect  the 
value  of  the  learning  factors.  It  is  possible  that  with  a  constant  learning  factor,  a  factor  that 
is  too  large  for  one  type  of  parameter  -  one  that  causes  oscillation  for  that  parameter  -  may 
be  too  small  for  another  type  of  parameter  and  effects  little  change.  That  is,  some 
parameters  become  more  willing  to  adapt  while  others  hesitate  to  change. 

Setting  up  separate  learning  factors  for  the  different  types  of  parameters  should  eliminate 
this  problem.  However,  choosing  a  learning  factor  is  still  a  complex  trial-and-error  task, 
and  having  more  learning  factors  to  deal  with  requires  more  sophisticated  understanding 
of  the  learning  dynamics  we  possess.  Plots  of  the  mean  squared  error  of  two  sets  of 
randomly  chosen  training  data  suggest  that  there  are  noticeable  points  where  the  rate  of 
decrease  dramatically  changes  (see  the  following  figure). 
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Fig.40:  The  influence  of  the  learning  factor 

More  rules  and  features  should  be  added  to  improve  this  LMS  system.  As  the  complexity 
of  the  system  grows,  however,  the  design  will  depend  more  on  the  learning  algorithm  than 
on  heuristic  knowledge.  This  requires  much  more  understanding  of  the  learning  dynamics. 
Preliminary  testing  with  three  features  and  eight  rules  shows  little  improvement  in 
recognition  rate.  Obviously  many  additional  studies  need  to  be  done  in  this  case. 

As  mentioned  in  chapter  "Setting  Linguistic  Rules",  for  future  investigations  one  may  also 
experiment  with  different  decision  thresholds  for  determining  deception  and  nondeception. 
However,  the  benefit,  if  any,  of  this  is  not  clear.  One  may  also  experiment  with  mapping 
the  fuzzy  output  to  a  confidence  value  in  addition  to  just  a  deception/nondeception 
decision.  This  may  be  more  helpful  in  practical  situations.  One  should  also  test  the 
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algorithm  with  random  initializations;  that  is,  without  using  any  expert  knowledge.  It 
would  be  interesting  to  compare  the  training  time,  performance,  and  robustness  of  that 
system  to  the  present  one. 

Fuzzy  logic  systems  promote  rapid  development  of  robust,  simple,  and  reliable  systems. 
Our  project  validated  that  point.  Some  of  the  main  problems  with  designing  traditional 
fuzzy  logic  systems,  however,  are  their  dependence  on  heuristic  information,  their  lack  of 
design  automation  and  their  unproven  ability  to  reach  an  optimal  solution  by  linguistic 
rules  alone.  Our  use  of  the  LMS  learning  algorithm  attempts  to  solve  such  problems.  The 
learning  algorithm  did  offer  small,  incremental  improvements,  but  we  believe  that  the 
learning  algorithm  has  not  yet  been  explored  fully.  A  better  understanding  of  the  learning 
dynamics  would  offer  more  insight  into  improving  the  system. 

In  future  works,  one  may  also  consider  other  strategies  which  use  irrelevant  questions, 
(see  Fig.7).  These  questions  could  be  easily  exploited  for  normalizing  the  data  and  making 
it  independent  of  individual  charateristics  of  the  tested  subjects. 

5.2.  The  polveraph  examination: 

As  expected61,  and  eventually  proven62,  our  clustering  system  can  provide  an  up  to  12% 
more  correct  detection  rate  by  using  the  dependency  between  the  polygraph  sessions. 
Therefore,  I  recommend  recording  at  least  three  -  ideally  five  -  test  sessions  with  different 
a  order  of  questions  per  each  examinations.  Thus,  in  cases  where  some  sessions  within  an 
examination  are  clustered  incorrectly,  the  algorithm  can  easily  ignore  the  minority  and  find 
the  right  cluster  according  to  the  correctly  clustered  majority. 

One  may  also  consider  other  time  frames,  and  emphasize  those  features  which  enabled  us 
to  cluster  the  data  the  best.  It  may  also  be  helpful  to  mark  the  data  of  female  and  male 
subjects,  or  to  consider  them  differently,  since  the  ranges  of  the  biophysical  reactions  are 
not  in  the  same  numerical  spaces. 

Ultimately,  an  automated  polygraph  system  which  uses  the  aforementioned  strategies  to 
distinguish  between  truth  and  deception  should  have  a  built-in  feature  extraction  tool 
which  can  directly  feed  the  needed  data  to  the  algorithm. 


61See  chapter  3. 1.3.4. 
62See  chapter  4.I.2.3. 
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Fig.41:  List  of  labels  of  all  the  features  used  in  this  project 
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max(r)  /  max(c) 

590 

HFUR 

min  cross  correlation 

I 

| 

591 

HFUR 

min  cross  correlation 

ave(r)  +  ave(c) 

592 

HFUR 

min  cross  correlation 

maxOr)  -  max(c) 

593 

HFUR 

min  cross  correlation 

min(r)  -  min(c) 

594 

HFUR 

min  cross  correlation 

max(r)  -  min(c) 

595 

HFUR 

min  cross  correlation 

min(r)  -  max(c) 

596 

HFUR 

lag  min  cross  correlation 

max(r)  /  max(c) 

597 

HFUR 

lag  min  cross  correlation 

f 

! 

598 

HFUR 

lag  min  cross  correlation 

ave(r)  +  ave(c) 

599 

HFUR 

lag  min  cross  correlation 

max(r)  •  max(c) 

600 

HFUR 

lag  min  cross  correlation 

min(r)  -  min(c) 

601 

HFUR 

lag  min  cross  correlation 

max(r)  -  min(c) 

602 

HFUR 

lag  min  cross  correlation 

min(r)  *  max(c) 

603 

HFEC 

spec  HFEC  fund  freq 

max(r)  /  max(c) 

604 

HFEC 

spec  HFEC  fund  freq 

| 

j 

605 

HFEC 

spec  HFEC  fund  freq 

ave(r)  +  ave(c) 

606 

HFEC 

spec  HFEC  fund  freq 

max(r)  -  max(c) 

607 

HFEC 

spec  HFEC  fund  freq 

min(r)  •  min(c) 

608 

HFEC 

spec  HFEC  fund  freq 

max(r)  -  min(c) 

609 

HFEC 

spec  HFEC  fund  freq 

min(r)  -  max(c) 

610 

HFEC 

spec  HFEC  2nd  harmonic 

max(r)  /  max(c) 

611 

HFEC 

spec  HFEC  2nd  harmonic 

| 

| 

612 

HFEC 

spec  HFEC  2nd  harmonic 

ave(r)  +  ave(c) 

613 

HFEC 

spec  HFEC  2nd  harmonic 

max(r)  -  max(c) 

614 

HFEC 

spec  HFEC  2nd  harmonic 

min(r)  -  min(c) 

615 

HFEC 

spec  HFEC  2nd  harmonic 

max(r)-min(c) 

616 

HFEC 

spec  HFEC  2nd  harmonic 

min(r)  -  max(c) 

617 

UR 

spec  UR  fund  frequency 

max(r)  /  max(c) 

618 

UR 

spec  UR  fund  frequency 

| 

i 

619 

UR 

spec  UR  fund  frequency 

ave(r)  +  ave(c) 

620 

UR 

spec  UR  fund  frequency 

max(r)  -  max(c) 

621 

UR 

spec  UR  fund  frequency 

min(r)  -  min(c) 

622 

UR 

spec  UR  fund  frequency 

max(r)  *  min(c) 

623 

UR 

spec  UR  fund  frequency 

min(r)  -  max(c) 

624 

UR 

spec  UR  2nd  harmonic 

max(r)/max(c) 

625 

UR 

spec  UR  2nd  harmonic 

ave(r)  -  ave(c) 

626 

UR 

spec  UR  2nd  harmonic 

ave(r)  +  ave(c) 

627 

UR 

spec  UR  2nd  harmonic 

max(r)  -  max(c) 

628 

UR 

spec  UR  2nd  harmonic6 

min(r)  -  min(c) 

629 

UR 

spec  UR  2nd  harmonic 

max(r)  -  min(c) 

630 

UR 

spec  UR  2nd  harmonic 

min(r)  -  max(c) 

631 

HFUR 

max  cross  spec  density 

max(r)  /  max(c) 

632 

HFUR 

max  cross  spec  density 

ave{r)  -  avefc) 

633 

HFUR 

max  cross  spec  density 

ave{r)  +  avefc) 

634 

HFUR 

max  cross  spec  density 

max(r)  -  max(c) 

635 

HFUR 

max  cross  spec  density 

min(r)  -  minfc) 

636 

HFUR 

max  cross  spec  density 

max(r)  -  min(c) 

637 

HFUR 

max  cross  spec  density 

min(r)  -  maxfc) 

638 

HFEC 

coherency  HFEC&URff 

max(r)  /  maxfc) 

639 

HFEC 

coherency  HFEC&URff 

ave(r)  -  ave(c) 

640 

HFEC 

coherency  HFEC&URff 

ave(r)  +  ave(c) 

641 

HFEC 

coherency  HFEC&URff 

max(r)  -  maxfc) 

642 

HFEC 

coherency  HFEC&URff 

min(r)  -  min(c) 

643 

HFEC 

coherency  H  FEC  &  UR  ff 

max(r)  -  minfc) 

644 

HFEC 

coherency  HFEC  &  UR  ff 

min(r)  -  maxfc) 

645 

HFEC 

coherency  HFEC&URsh 

maxfr)  /  maxfc) 

646 

HFEC 

coherency  HFEC&URsh 

avefr)  -  avefc) 

647 

HFEC 

coherency  HFEC&URsh 

avefr)  +  avefc) 

648 

HFEC 

coherency  HFEC&URsh 

maxfr)  -  maxfc) 

649 

HFEC 

coherency  HFEC&URsh 

minfr)  -  minfc) 

650 

HFEC 

coherency  HFEC&URsh 

maxfr)  *  minfc) 

651 

HFEC 

coherency  HFEC&URsh 

minfr)  -  maxfc) 

652 

GSR 

max  min  ISD  cont&relv 

meanfr  &  c) 

653 

GSR 

max  min  ISD  cont&relv 

maxfr  &  c) 

654 

GSR 

max  min  ISD  cont&relv 

minfr  &  c) 

655 

GSR 

freq  max  ISD 

meanfr  &  c) 

656 

GSR 

freq  max  ISD 

maxfr  &  c) 

657 

GSR 

freq  max  ISD 

minfr  &  c) 

658 

GSR 

area  under  ISD 

meanfr  &  c) 

659 

GSR 

area  under  ISD 

maxfr  &  c) 

660 

GSR 

area  under  ISD 

minfr  &  c) 

661 

HFEC 

max  min  ISD 

meanfr  &  c) 

662 

HFEC 

max  min  ISD 

maxfr  &  c) 

663 

HFEC 

max  min  ISD 

minfr  &  c) 

664 

HFEC 

freq^max  ISD 

meanfr  &  c) 

665 

HFEC 

freq  max  ISD 

maxfr  &  c) 

666 

HFEC 

freq  max  ISD 

minfr  &  c) 

667 

HFEC 

area  under  ISD 

meanfr  &  c) 

668 

HFEC 

area  under  ISD 

maxfr  &  c) 

669 

HFEC 

area  under  ISD 

minfr  &  c) 

Non-deceptive 

Deceptive  1 

Deceptive  2 

Deceptive  3 

QQ8R9OIO.011 

QQ8R9OIO.021 

QQ8R9OIO.031 

QQ95LU1T.011 

QQ95LU1T.021 

QQ95LU1T.031 

QQAURNUS.021 

QQAURNUS.031 

QQAV53P6.011 

QQAV53P6.021 

QQAV53P6.031 

QQBQ4SHI.011 

QQBQ4SHI.021 

QQBQ4SHI.031 

QQBSS7WT.011 

QQBSS7WT.021 

QQBSS7WT.031 

QQ70XM60.021 

QQ7RH0RO.011 

QQ7RH0RO.021 

QQ7RH0RO.03 1 

QQ7R51P9.011 

QQ7R51P9.021 

QQ7R51P9.031 

QQ9TDSP3.011 

QQ9TDSP3.021 

QQ9TDSP3.031 

QQA8OWOI.011 

QQA8OWOI.021 

QQA8OWOI.031 

QQBT22O6.011 

QQBT22O6.021 

QQBT22O6.031 

QQB090  9.011 

QQBO9O_9.021 

QQBO9O_9.031 

QQBC7PP6.011 

QQBC7PP6.021 

QQBC7PP6.031 

QQCHCK  0.011 

QQCHCK  0.021 

QQCHCK  0.031 

QQCDTKP0.01 1 

QQCDTKP0.031 

QQCDTKP0.041 

QQCM5Y56.011 

QQCQQT8Y.011 

QQCQQT8Y.021 

QQCQQT8Y.03 1 

QQCQQT8Y.041 

QQ4Q1O83.011 

QQ4Q1O83.021 

QQ4Q1O83.031 

QQ4Q3MDC.011 

QQ4Q3MDC.021 

QQ4Q3MDC.031 

QQ51DE36.011 

QQ51DE36.021 

QQ51DE36.041 

QQ6RQGH6.011 

QQ6RQGH6.021 

QQ6RQGH6.03 1 

QQ6RQGH6.041 

QQ6T7 110.011 

QQ6T7 110.021 

QQ6T71 10.031 

QQ6Z59IG.011 

QQ6Z59IG.021 

QQ6Z59IG.031 

QQ7PP9B9.011 

QQ7PP9B9.021 

QQ7PP9B9.031 

QQ7PDU1X.011 

QQ7PDU1X.021 

QQ7PDU1X.031 

QQ7  PIPF.011 
QQ7PIPF.021 

QQ7  PIPF.031 

QQ7  JT70.011 
QQ7_JT70.021 

QQ7  JT70.031 

QQ738DYX.011 

QQ738DYX.021 

QQ738DYX.031 

QQ75ULP9.011 

QQ75ULP9.021 

QQ75ULP9.031 

QQ79  EYF.011 

QQ 79  EYF.021 

QQ79  EYF.031 

QQ7BGDML.011 

QQ7BGDML.021 

QQ7BGDML.031 

QQ7ETC81.011 

QQ7ETC8I.021 

QQ7ETC8I.031 

QQ7JAQCS.011 

QQ7JAQCS.021 

QQ7JAQCS.031 

QQ7LX5Q0.011 

QQ7LX5Q0.021 

QQ7LX5Q0.031 

QQ7MN2Y0.011 

QQ7MN2Y0.021 

QQ7MN2Y0.031 

QQ7TC5UF.011 

QQ7TC5UF.021 

QQ7TC5UF.031 

QQ7TQVER.011 

QQ7TQVER.021 

QQ7TQVER.031 

QQ7TVADC.011 

QQ7TVADC.021 

QQ7TVADC.031 

QQ7U2T4R.011 

QQ7U2T4R.021 

QQ7U2T4R.031 

QQ7YP7QU.011 

QQ7YP7QU.021 

QQ7YP7QU.03 1 

QQ7YZOJ3.011 

QQ7YZOJ3.021 

QQ7YZOJ3.031 

QQ8_0DPT.011 

QQ 8  0DPT.021 

QQ8  0DPT.03 1 
QQ8_0DPT.041 

QQ8  2UQ9.011 

QQ8  2UQ9.021 

QQ8_2UQ9.03 1 

QQ800IG6.011 

QQ800IG6.021 

QQ800IG6.03 1 

QQ82OIU9.011 

QQ82OIU9.021 

QQ82OIU9.031 

QQ82SUTX.011 

QQ82SUTX.021 

QQ82SUTX.031 

QQ860ZNU.011 

QQ860ZNU.021 

QQ860ZNU.031 

QQ89U_ZR.011 

QQ89U_ZR.021 

QQ89U  ZR.031 

QQ8ATU26.011 

QQ8ATU26.021 

QQ8ATU26.031 

QQ8FGMVI.011 

QQ8FGMV1.021 

QQ8RAJ0C.01 1 

QQ8RAJ0C.021 

QQ8RAJ0C.03 1 

QQ9EUKVT.011 

QQ9EUKVT.021 

QQ9EUKVT.031 

QQ9IOOXO.021 

QQ9IOOXO.041 

QQ9SOW8L.011 

QQ9SOW8L.021 

QQ9SOW8L.031 

QQ9SQEK9.01 1 

QQ9SQIK9.021 

QQ9SQK9.03 1 

QQ9W0B9F.01 1 

QQ9W0B9F.031 

QQ9W0B9F.041 

QQ9U4FMU.011 

QQ9U4FMU.02 1 

QQ9U4FMU.03 1 

QQ9Y  SW.011 

QQ9Y  SW.021 

QQ9Y  SVF.031 
QQ9YH3QF.01 1 
QQ9YH3QF.021 
QQ9YH3QF.03 1 
QQA2TT4C.011 
QQA2TT4C.021 
QQA2TT4C.031 
QQA3HTRX.01 1 
QQA3HIRX.021 
QQA3HIRX.031 
QQA32UTF.01 1 
QQA32UTF.021 
QQA32UTF.031 

QQA6U  IF.01 1 

QQA6U  IF. 031 

QQA6U  IF. 041 
QQAM4E3L.01 1 
QQAM4E3L.021 
QQAM4E3L.03 1 
QQARF2  X.011 
QQARF2_X,021 
QQARF2_X.03 1 
QQAWA38X.011 
QQAWA38X.021 
QQAWA38X.031 
QQAYXZGU.01 1 
QQAYXZGU.021 
QQAYXZGU.031 

Fig.42:  List  of  polygraph  files  used  in  this  experiment 
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6.3.  USER  INTERFACE 


For  an  automated  polygraph  system  as  a  real  product,  the  existence  of  an  user-friendly 
interface  is  unavoidable.  MATLAB  software  environment  provide  an  easy-to-use  toolbox 
for  creating  various  kinds  of  interactive  interface  classes.  The  following  figure  shows  an 
interface  used  in  one  of  my  representations.  This  was  made  for  a  technically  oriented  user 
who  is  familiar  with  the  algorithm.  A  simpler  black-box  version  of  a  polygraph  system, 
appropriate  to  the  user's  requests,  can  likewise  be  programmed. 


Fig. 43:  An  example  for  a  technical  user  interface 


6.4.  PROGRAM  LISTINGS 

(Implementation  in  MATLAB) 
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%  THIS  PROGRAM  CALCULATE STHE  CLUSTER  CENTERS  FOR 
V.  A  MULTIDIMENSIONAL  FCM  -  C=2,  CONST. 

function  V  -  c_center(X,  U,  m) 

[colE,  rowE]  ”  sizeQO; 
k=l:rowE; 

Hforthe  1th  class: 

Vl_numerator  ^UG^)*!!!  *  XC.k)'; 

%  (.•>=><"):  because  the  "numerator  sum"  is  automatically 
%  included  within  the  matrix  multiplication. 

V(l,:)  «  V  ^numerator  /  sum(  UOJ0  .*m ); 

%  V(1 .:)  [and  V  ^numerator)  is  a  n-dimensional  row-vector, 

%  n  represents  the  number  of  the  clustering  festures(n=30). 

%for  the  2nd  class: 

V2_numeretor«U(2,k).Am  *  X(:tf  , 

•/.  (*>“>(•):  ...*ee  above. 


V(2,:)  *  V2_numerator  /  sum(  U(2,k)  .Am  ); 

%  This  is  a  n-dnnensional  row-vector  and  the  duster-center 
V.  of  the  2nd  class. 


V=V;  %  [nxc]  matrix 

return; 


%  FUZZY  C-MEANS  ALGORITHM  FOR  MULTI-DIMENSIONAL  FCM. 

%  function  bestJJik  -  fc_means(m,  epsilon,X) 
function  [bestJJik,  z]  -  fc_means(m,  epsilorOO 
•/.function  bestJJik  =  fc_means(m,  epsilon) 

•/.function  [bestJJik,  V,  X)  =  fc_means(m,  epsilon) 

%  think  about  the  X 

load  init  u,  %  start  with  tire  initialization  of  the  memb_fct 

%  (Uik  =>  Vi) 


•/.  load  initjr, 


•/•  or  with  the  cluster  centers 
•/.(Vi  =>Uik) 


•/.load  set3 1 ;  %  including  the  data  X  respect  X  l ,  X2,  ... 

•/*X=featmat; 

•/.load  set3me  ,X=Xselect; 

•/.format  long;  %  avoid  errors  by  visual  comparing  the  numbers 

J_m  =  1 00000000,  %  to  make  sure  the  start  is  o  Jc. 

z=0, 

while  J_m  >  epsilon 


V  =  c_center(X,  U,  m); 


U  -  memb_fct(X,  V,  m); 


Jtemp  =  J_m, 

J_m  =  j_mdim(X,  V,  U,  m); 

if  epsilon  <=  0.000005 

if  (abs(J_m  -  Jtemp)  <=  .00000000001), 

Sif  J_m  —  Jtemp,  %  to  terminate  the  loop  by  reaching 

%  the  minimum  of  J_m. 

break, 

end 

else 

if  (abs(J_m  -  J temp)  <*  .000 1 ),%-— o k. 

Vof  J_m~—  Jtemp,  %  to  terminate  the  loop  by  reaching 

%  the  minimum  of  J_m. 

break, 

end 

end 


I 


V.  t  -  abs(U  -  temp);  54  tolerance  value  for  the  iteration 

z*z+l; 

ifrem(z,10)-=O 

fprintfC.VO; 

else 

fprintff.  *); 

end 

end 

fprin  tfljri)  ;fprintfC _  W)l 

bestJJik  *=  U, 
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%Vnew  -  V; 

%  recall  the  extrem  values:  J_m  =7.2308e+003 
return; 

•/.THIS  PROGRAM  CALCULATES  THE  OBJECTIVE  FUNCTION 
%  FOR  THE  MULTIDIMENSIONAL  FCM. 

function  J_m  «  j_mdim(X,  V,  U,  m) 

(colE^owE)  -  size(X), 
k*=  l^owE; 

V.forthe  1th  class: 

VlasMatrix  -  V(:,l)*ones(l,rowE); 

tempi  -  ( X(:Jc)  -  VlasMatrix  y  *  (  X(Jc)  *  VlasMatrix ); 

templl -(  (U(l,:)./'m)  .*  (diagCtempl)')  ); 

J_out  1  =  sum(templl); 

%for  the  2nd  class: 

V2asMatrix  -  V(:>2)*ones(l>rowE); 

temp2  -  (  X(: *)  -  V2asMatrix )'  *  ( X(: J0  *  V2asMatiix  ); 

temp22  *=  (  <U(2I:V'nO  *  (diag(temp2)')  ); 

J_out2  =  sum(temp22); 

J_m  “  J_outl  +  J_out2; 
return; 

V.  THIS  PROGRAM  CALCULATES  THE  MEMBERSHIP  VALUES  FOR 
%  THE  MULTIDIMENSIONAL  FCM. 

function  U  =  memb_fct(X,  V,  m) 

[colE,rowE]  =  size(X); 
k*  IjowE; 

%forthe  1th  class: 

VlasMatrix  *  V(:>l)*ones(lIrowE); 

•/•  to  avoid  time -crunching  for-loops 

temp  1  -  ( X(:,k)  -  V 1  asMatrix  )'  *  (  X(:,k)  -  VI  asMatrix  ); 

%  trick:  matrix -operation  is  Caster, the  sought  norm  is 
%  automatically  the  diagonal  of  tempi ; 

U_num(l,k)  =  (  diagCtempl)’ )  .A  ( -l/(m-l>), 

•/.for  the  2nd  class: 

Y2as  Matrix  =  V(:,2)’ones(l  ,rowE); 

%  to  avoid  time -crunching  for-loops 

temp2  =  ( X(:,k)  -  V2asMatrix )’  *  ( X(:Jc)  -  V2asMatrix); 

%  see  above 

U_num(2,k)  -  (  diag(temp2), )  A  ( -I/(m-l)  ); 


*/%  to  avoid  time-crunching  for-loops 

%  trick  matrix-operation  is  faster, the  sought 
%  automatically  the  diagonal  of  tempi; 


%  to  avoid  time-crunching  for-loops 
•/•  see  above 


U(!,:)~U_num(l,k)  J  (  U_num(l,k)+  U_num(2Jc)  ); 

U(2,:)  =  U_num(23c)  J  (  ulnum(U)  +  U_num(2jc)  ); 

%  If  there  is  a  third  class,  *  U_num(3,k)  ...  ■ 

%  must  be  also  considered. 

return; 

%  FAST  MULTIDIMENSIONAL  EVALUATING  PROGRAM 
clear  best_Uik; 

% - without  plots 

best_Uik  =  fc_means(5, 0.0000005,  Xselect); 


figure(l);clg;hold  on; 
ss=l:100; 

^ot(ss,best_Uik(l,:):+’)^>lot(ss,best_U^k(2,:);•b,); 
%plot(ss,best_Uik(3,:V#  b1) 


wrong_dcps  =  0; 
wrong_nons »  0, 
figure(2);clg;hold  on; 
forr=T:100 

if  best_Uik(2,s)>=.5 
plot(s,best_Uik(2,s),,*b'); 
ifs>50 

wrong_dcps=wrong_dcp$+ 1 ; 
end 
else 

plot(s,best  Uik(2Fs),'+’); 

if  s<=50 

wi  ong_nons=wrong_nons+ 1 ; 
end 
end 
end 

wpercent «  wrong_dcp$/50*  100; 

•/.fyrintfl>Tong_dcps,  percent') 

•/.[wrong_dcps,  wpercent] 
npercent  *  wrong_nons/50*  100; 

•/«  fyrintf^Vrong^nons,  npercent') 

%[wrong_nons,  npercent] 

nn=<100-npercent); 
ww=<l  00- wpercent); 

fyrintfCVn^lpnntfrRlGHT  DETECTIONS:'); 
j^rintf^nOifprintfOn^^iintfCHl^lust  D_clusf); 

[nn  ww], 


%  USER  INTERFACE 

%  Program  B1 .  This  program  creates  the  start  button. 

figure(l);clg; 
set(gcC'color\[l  0  1]) 

button  1  *=uicontroI(gcC... 

’style', *push't... 

*position',[195  150  75  75],... 

'stringVSTART,... 

*callback7bt_choic') ; 


•/.  USER  INTERFACE 

*/«Program  B2.  This  program  displays  choices  to  run  the  various  programs. 


elf  reset 

setCgcC'colo^lO  0  1]) 

titleCONE-DIMENSIONAL  MULTI-DIMENSIONAL’) 

axis  off 

&m2  =  uicontroKgcf... 

‘styleVtexf,... 

•position’,125  40  155  200]); 

tt2  ■=  uicontrol(gcC... 

‘style’, text1,,.. 

’string’.TEATURE  ELIMINATION1,... 

'position' J 25  215  155  40ft; 

frm4  «■  uicontrol(gc£... 

’styte'.'frame',... 

•position', [25  270  155  70Di 

tt4  «  uicontrol(gcC ... 

'style'.text',... 

’stiing'/FUZZY  C  MEANS  WITH  EVALUATION1,... 

•position’,[35  288  125  45D; 


button3  «  uicontrol(gc  f; . , . 
’style’.'push',... 

•position*, [38  275  125  25],... 
’string’, INITIAL  TEST,... 
'callback',1 ’megajsf) ; 

fhn  *uicontrol(gc£... 
’styleV&ame’,... 

’position', [205  40  95  185]); 


tt  *  uicontrol(gcC.. 
'styleVtexf,... 


'string* /POLYGRAPH  DATA’,... 
'position', {20 7  165  85  40J); 

button  1 3  =  uicontrol(gcf;... 
‘style’/push’,... 

•positiori,[210  75  80  25],... 
'string'.'DATA  3’,... 
‘callb&ck.load  ftx3); 

button  14  -  uicontrol(gcC... 
•style'.'push',... 

•position’, [2 10  105  80  25],... 
'string'.'DATA  2*,... 

•callback’, load  ftx2'); 

buttonl5  *  uicontrol(gc£... 
'style'/push',... 

•position' ,[  2  10  135  80  25],... 
'string’.'DATA  1\... 
'callback/load  ftxT); 


button  16  =  uicontrol(gc£,- 
'style', 'push',... 

*positiori,[210  45  80  25],... 

'string', 'CLEAR',... 

'caBback/clear*); 

button!7  -  uicontrol(gcC-.. 

'style'/push',... 

•position',  [4  5  200  125  25],... 

'string*/BOTH  >60%*,... 

'callback1, ‘megaj'); 

buttonl8  =  uicontrol(gcC-- 
'style'/push',... 

"positiori,[45  150  125  25],... 

'string1, *>80%  AND  >50%',... 

•caHback,'mega_ii‘); 

button  19  =  uicontrol(gc£... 

'style'/push',... 

•position’, [45  100  125  25],... 

'string*/>50%  AND  >80%',... 

'callback/megajii'); 

button20  =  uicontrol(gcf/.. 

'style'/push',... 

"position', [45  50  125  25],... 

'string*, 'ONE  >98%',... 

'callback/megaJY), 

frm3  =  uicontrol(gc{;... 

'style', 'frame',... 

“position*  ,[320  40  165  185D, 

tt3  =  uicontrol(gcC... 

'style', text*,... 

•string1, 'SEARCH  FOR  BEST  COMBINATION',... 
"position',] 350  1  50  120  65]), 

button21  =  uicontrol(gcC~. 

'style', ‘push’,... 

"position1, [318  230  192  25],... 

'string’/FEATURE  COMBINATION1,... 

'callback, "ini  tfasti); 


frm5  «  uicontroKgcC... 

'ityle', 'frame',..  . 

•position', [3 18  260  140  85p; 

tt5  »  uicontrol(gcC... 

'style', text*,.., 

'string*, “FUZZY  C  MEANS  WITHOUT  EVALUATION1,... 
•position', [332  275  115  65]), 

button  4  -  uicontrol(gcC... 

'style'.'push',... 

“position',] 32 5  265  125  25],... 

1  s  tring' , '  ALGORFTH  M1 , . . . 

'callback, 'fc_means')( 

button22  «  uicontroKgcC... 

'style'/push1,... 

"positiori,[337  125  100  25],... 

'string*, 'GENETIC',... 

'call  back, ’genetic4'); 

button23  -  uicontrol(gc£,... 

'style'.'push',... 

"poeitiori,[337  95  100  25],... 


'string', 'RANDOM’,,.. 

’callback', ‘random’); 

button24  -  uicontrol(gcC... 
’style'i'push,,... 

•position’, [337  65  145  25],... 

•string', TSEUDO-EXHAUSnVE’,... 
’callback', ,fcature4’); 


%  THIS  PROGRAM  COMPARES  RESULTS  BY  DIFFERENT  SET-UPS 
%  OF  THE  ‘m’.  AN  EXAMPLE: 

w_comp~ZCTos(l  ,669); 
n_cotnp=2cros(l  ,669); 

indeed  3  5  15  17  19  22  29  30  31  33  36  37  38  39  40  50); 
selindex=l:17; 


w_conap(mdex)  -  selw_peTcent(selmdex)  -  w j>ercent(index); 
n_co*np(index)  =  seln_percent(selindex)  -  n_percent(mdex), 

Rindex={70  141  155  1  77  197  200  202  211  214  216  235  449  450  453  458  462  600]; 
schndex=  18:34; 

w_comp(Rindex)  *=  sehv  _percent(selindex)  -  w_percent(Rindex), 
n_oomp(Rindex)  =  seln_percent(seEndex)  -  n_percent(Rindex); 


•/•for  11  newis; 

newindaces={4  12  18  52  68  82  176  395  451  459  460]; 
w_comp(newin  dices)  =  w_percent(newindices); 
n  comp(newindices)  *=  n_j>ercent(nCTr*in dices), 

in={l  3  4  5  12  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68  70  82  141  155 ... 
1 76  1 77  197  200  202  21 1  2 1 4  2 16  235  395  449  450  451  453  458  459  460  462  600]; 


[in  jn2w_percent^n2n_percent;w,w_comp0n)  ^_compCm)]' 


•/•  ANOTHER  EXAMPLE: 

w_comp=zeros(l  ,669); 
n_comp=zeros( 1 ,669) ; 

index={l  3  4  5  12  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68  ... 
70  82  141  155  176  177  197  200  211  214  216  235  395  449  450  451]; 
seHndex=l:38; 

w_comp(mdex)  =  selw_percent(selindex)  -  w__peroent(tndex); 
n_comp(index)  *=  $eln_percent(selindex)  -  n_percent(mdex), 

Rindex=[453  458  459  460  462  600]; 
selindex=40:45; 

w_comp(Rindex)  =  selw_percent(selindex)  -  w_percent(Rindex); 
n_comp(Rindex)  =  selnj5ercent(selmdex)  -  n_percent(Rindex); 


•/ofor  1  newy; 
newindices=[452]; 

w_comp(newindices) L  w_percent(nevrindices); 
n_comp(newindices)  “=  n_percent(newindices), 

in=[l  3  4  5  12  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68... 

70  82  141  155  176  177  197  200  211214  216  235  395  449  450  451  452... 
453  458  459  460  462  600]; 

[in^n2w_percenLm2n_percent;w;w_comp0n)  ;n_comp(in)]' 


%  THIS  PROGRAM  SELECT  AND  EVALUATE  FEATURE  GROUPS 
%  ACCORDING  TO  THE  THRESHOLD. 


dimensiorv=669, 


H); 

for  g=l:dimension 

% - ATTENTION:  Change  parameters  for  m=3... 

ifl;  (n_percent(g)<=40)  A  (w_percent(g)<-40) ) 

1=4+1; 

m2wrong_dcps(T)==svTong_dcps(g), 

m2wj>rrcent0)=rw_percent(g). 


m2w_ok(l)=l  00-m2w_percent(l); 


m  2  wrong_nons(I)=wrong_nons(g) ; 
ro2n_perccnt(l>=n_p«rccnt(g); 

m2n_ok(I>=  1 00-m2n_percent(l); 

m2z(I)=z(g); 

ift  (n_pcrcent(g><*25)  |  (w_percent(g)<«=25) ) 

wO>=i.llll; 

else 

w(I)-0, 

end 

end 

end 

1 

fprintfCm2fl  #,  m2wrong_dcps,  m2w_olc,  m2wrong_nons,  m2n_ok,  m2iterations,  bests'); 

h=lJ; 

feg(K) 

m2wrong_dcps(h) 

m2w_ok(h) 

ra2wrong_nons(h) 

m2n_ok(h) 

m22(h) 

w(h)r 


•/.THIS  PROGRAM  REPRESENTS  ONE  THE  RANDOM  SEARCH 
V.  FOR  4-TUPLE  FEATURE  COMBINATIONS. 

indi=0; 
for  *=1:10000 

aaa  *  round(10*rand(l,4));  •/. . 4*ft-&-size  ofno*=14 

%  ifaaa(l)>=7,  aaa(I)=aaa(l>5;end; 

%  if  aaa(2)>=7,  aaa(2>=aaa(2)-5;end; 

%  ifaaa(3)>=7,  aaa(3)=aaa(3}-5;end; 

ifaaa(l)=0,  aaa(I)=l  l;end; 
if  aaa(2)=0,  aaa(2)=12;end; 
if  aaa(3)=0,  aaa(3)=13;end; 
if  aaa(4)=0,  aaa(4)=14;end, 

while  (  (aaa(l>=aaa(2))  |  (aaa(l)=saa(3))  |  (aaa(2>=aaa(3)) ... 

|  (aaa(2)=aaa(4))  |  (aaa(t)=aaa(4)) ... 

|  (aaa(3)==eaa(4))  ) 

aaa  =  round(10*rand(l,4)); 

V.  ifaaa(l)>=7,aaa(l)=aaa(l)-5;end; 

•/•  if  aaa(2)>=7,  aaa(2)=aaa(2)-5;end; 

*/•  if  aaa(3)>=7,  aaa(3)=aaa(3)-5;end; 

if  &aa(  i  )=<),  aaa(  1  )=  1 1  ;end , 
if  «aa(2)=0,  aaa(2>=12;end; 
if  aaa(3)=0f  aaa(3)=  1 3;end; 
if  aaa(4)=0,  aaa(4)=14;end; 

end, 


indi, 

aaa, 

clear  Xselect; 

Xselect=Xsel(aaa,:); 

initfast; 

% - ATTENTION:  LIMITATIONS - •  %if - 

Sif(  ((nn>=80)  &.  (ww>=80))  |  ( (nn>-=S4)  |  (ww>-84) )  ) 

if  (((nn>=8 1  )&(ww>=S  1 ))  |  ((nn>=86)&<ww>=79)))  % - &Jl  4*ft  x3m5m2 

•/•if ( (nn>=70)  &  (ww>=S0))  % - 4*ftxlm5 

indi=indr+l ; 

*l_combm(indi)  *  aaa(l); 
t2_cornbin(mdi)  =  aaa(2), 
a3_combin(indi)  «=  aaa(3); 

»4_combinCuidi)  -  aaa(4);y. - 4*ft 

n_combres(indi)  “  nn; 
w_combres(indi)  “  ww; 

fyrmtfC»»»»»»»»»»»»»»»>»»'); 

size(aI_coTTibin) 

fyrintfO»»»»»»»»»»»»»»»»»'); 

end 

end 


j=ldndi; 

(al_combin(j) 

t2_combin(j) 


a3_combin(j) 

a4_combin(j) 

n_combres<j) 

w_combre$(j)]‘ 


%  This  program  exhaustively  tests  all  possible  combinations  of 
%  size  eight  in  x3  from  the  number  of  features.  It  then  records  the  ones 
%  that  meet  the  if-then  criteria  below. 

%  clearfinit')  for  normal  initialization. 

load  x3 

features^  1 1 1  450  452  197  459  30  ] 
n=kngth(features) 

for  Mm 

Xse 1(0 : 1 00)=x3(features(f)1 1 : 100); 
end 

if  existCinit>“l 

%  program  continuation.  No  need  to  initialize  other  variables. 
il«=init(l), 
i2=dnit(2); 
i3=init(3); 
i4=init(4); 
i5=init(5); 
i6=init(6); 
i7=init(7); 
i8=init(8); 
else 

%  initialize  all  variables. 
indi=0; 
record=[]; 
il-1; 
i2=2; 
i3=3; 
i4=4; 
i5=5; 
i6=6; 
i7=7; 
i8=8; 
end 


while  il<=n-7 
while  i2<=n-6 
while  i3<=n-5 
while  i4<=n-4 
while  i5<=n-3 
while  »6<=n-2 
while  i7<=n-l 
while  i8<=n 

aaa=(il  i2  i3  i4  i5  i6  i7  i8] 
indi 

clear  Xselect; 
Xselect=Xsel(aaa,:); 

initiast; 


•/. - ATTENTION:  LIMITATIONS - "  %if" - 

ave  =  (nn+wwy2; 

if  (  «nnx=81)&<^>=81))  t  (ave>=83)  ) 
indi=indi+ 1 ; 

record=(record  ;  features<aaa)  nn  ww], 
fprintff  >»»»»»»»»»»»»»»»»»’); 
(features(aaa)  nn  ww] 
end 


i8=i8+I; 

end  end  i8  loop 

i7=i7+ 1 ; 

i8=i7+l. 

end  %  end  i7  loop 

i6=i6+l; 

i7-i6+l; 

i8=i7+l; 

end  end  i6  loop 

i5=i5+l; 

i6=i5+l; 

i7=i6+l, 

i8=T7+l; 

end  %  end  i5  loop 

i4=i4+l; 

i5=i4+l; 

»6=i5+I; 

fH6+l; 
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i8=47+l; 

end  %  end  i 4  loop 

i3=i3+l; 

i4=i3+l; 

i5-i4+l; 

j6=i5+l; 

i7=i6+ 1 ; 

i8=i7+ 1 ; 

end  %  end  i3  loop 

i2=i2+I; 

i3=i2+l; 

i4=i3+l; 

15=44+1; 

i6=i5+l; 

17=46+1; 

i8=47+l; 

end  %  end  i2  loop 

iNl+I; 

12=41+1; 

i3=42+l; 

i4=43+l; 

i5=44+l; 

i6=i5+l; 

i7=46+I; 

i8=47+l; 

end  %  end  il  loop 

record 


•/o  Genetic  algorithm  in  search  of  the  optimal  n-tup!e 
%  from  a  gene  pool  of  features. 

%  This  version  records  the  actual  feature  numbers  in  the 
%  matrix  ’record',  not  the  index! ! 

%  x3.  Set  m  in  initfast. 

•/•  set  init=l  for  automatic  initialization 

comment=,x3,  m=5, 15- tuple.' 

n=15; 

loadx3 

clear  Xselect; 

features=[9  1 1  30  50  39  81  235  358  359  363  449  197  29  450  453  457  458  4  78  ... 

Ill  452  482  361  1 5  36  37  32  8  67  79  460] 

%  features=f4  5  8  9  12  18  19  22  29  30  33  36  39  40  50  56  62  76  79  81  ... 

%  111  114  163  197  235  358  359  361  363  403  449  450  452  453  456  457... 

%  458  477  478  482  534  625  ] 

feature_num=Iength(features) 

for  fr=l:feature_num 

Xselft  l :  1 00)=x3(features(0, 1 : 1 00), 

end 

clear  x3; 

clear  average_fitness; 
ifinit=l 

*/•  initialize  population  size,  crossover  rate,  mutation  rate,  etc. 

populati  on_size=200 ; 

mutation_rate=0.00 1 ; 

crossover_rate=0.7; 

record=zeros(20/i+3); 

indi=0; 

%  initialize  population 
randCuniform'), 

population=fix((feature_num  •  .0000001) ."  rand(population_sizeJn))  +  1; 
end 

%/%  start  evolution 

for  generation^  1 : 1 00000 

generation 

•/«  test  the  population  for  fitness 
for  f  =  1 :  populati  on  _size 
Xselect  ■=  Xsel<popuiation(C:),:);*/* 

initfast;  %  test  each  individual 

fitness(f)  “  abs((nn+wv.’)/2  -  20),  %  subtract  20  to  exaggerate  the 

%  difference  in  fitness  ratio 

%if  (  ((nn>=70)A(ww>=70))  \  (fitness >=56)  |  ((nn<*=20)<&(ww<=20))  ) 
if(  (fitness(f)>=65)  |  ((nn<=20)&(ww<-20))  ) 
indi=indi+l 

record(indi,:)  -=  [  features(popuIati  on(f, :))  generation  rm  ww];% 
(features(population(fi:))  generation  nn  ww] 

end 

end 


%  display  average  fitness  in  percentage 
average_fitness(geneTation)=mean(fitness)  +  20 


%  REPRODUCTION  1! 

•A  reorder  the  fitness  values  for  easier  computation 
fit_measure(  1  H5tnes$(  1 ); 
for  f=2:population_size 
fit_measure(0=fit_measure<f-  l)+fitness(0, 
end 


for  I  :population_size 

•A  randomly  pick  one  individual  to  copy  into  the  new  population 
•/,  individuals  with  higher  fitness  values  are  more  likely  to  survive 
temp=fit_measure(population_size)  .*  rand; 

index*find(abs(fit_measure*temp)  — 5  min(abs(fit_measure-texnp))); 
if  temp  <«*  fit_mfasure(mdex(l)) 

new_population({;:>=populationCmdex(l),:);% 

else 


new_jx>pulation(C:H>opuktion(tndex(l>+ 1,:);% 


end 


end 

population=new_population; 


%  CROSSOVER !! 
f*I; 

while  f<=  population_size 

if  rand  <=  crossover_rete 
mate  =  f; 
crossover  *=  0; 

while  (f  <  populationsize)  &  (crossover=0) 

if  rand  <*=  crossover_rate 
•A  actual  crossover 
crossover  “  1; 

temp=fix((n  -  1 .00001)  .•  rand)  +  2 \*A 
gene_temp=population(mate,ternp:n);% 
populafionCmate.temp^HJopulationCCtcmpjt),*/* 
population(Ctemp-jn^gene_temp; 
end 
end 


end 

f=f+l; 


*A  MUTATION  !! 

•/•  Note:  Modified  Aug.  19  due  to  a  bug 

num_mutationz*population_size  .*  mutation^rate  .•  n  .•  (randn  +  1); 
for  f  =  1  mum_mutation 

population(fix((population_size-0.00000  l).*rand+  l),fix((n-0.0000 1  )*rand+ 1))... 

*  fix((feature_num  *  0.000001)  *  rand  +  1); 
end 

*A  save  record  in  case  of  crashing 

save  crashrec  comment  record  average  fitness 

•/•  go  to  next  generation 
end 

*A  display  record  of  good  individuals  in  history 

comment 

record 

•/.  [sort(record(l  tindi,  1  m))'  record(l  tindi7n+ 1  m+3)] 


•/.  SELECTION  AND  INITIALIZATION  OF  THE  DATA  CENTERS 
•/•FOR THE  LMS FILTER. 

%  *initrain_sess"  ■=■  Polygraph  sessions  which  are  used  for 
%  INITialization  of  the  "datacenters"  and  TRAIN ing 

%  The  ’initrain'  sessions  are  set  in  a  way  that  the  1st  part 
*A  (before  the  "border*)  represents  the  non-deeptive  and  the 
%  2nd  part  (after  the  border)  the  deceptive  sessions. 


clear, 

%•••  To  be  set  foe  each  potydat_i  (ftx3,  ftx2,  ftxl): 

S" 


whichfeatures_3  =  [  1 :30]; 
nondsessions_3  *=[11 :50]; 

V.[l  6  8  9  12  16  18  21  24  27  28  32  35  44  48]; 
depressions  J)  *  [51:90]; 

H[5I  53  58  59  63  67  72  75  82  85  88  89  93  95  100], 

ft 


whichfeatures_2  «  []; 
nondsessions_2  =  []; 
dqpsessions_2  =  []; 


%■ 


whichfeatures_l  *=  []; 
nondsessions_l  •=  []; 
dcpsessions_l  =  [}; 


*  * 


%*  ATTENTION:  The  DIMENSION  of  each  "whichfeatures_..."  is  to  be  equal! 
•  (or  zero) 


iflength(whichfeatures_3)  ~=  length(whichfeature$_2)  j ... 
kttgth(whkhfeatures_2)  Iength(whichfeatuies_I) . 

fpnntffChcck  'which features"  I  They  are  different  big'W); 

IpontfCThe  dimensions  are  as  followingiVn1); 

IprintfCW); 

fyrintfC  1st  2nd  3rchn0; 

disp(  [length(whichfeatures_l),  length(whichfeatures_2), ... 

length(whichjfeatures_3)] ) 

fprintfCtfO, 

frrintfTYOU  DO  NOT  NEED  TO  CHANGE  THE  EMPTY  ONESINn*); 
fyrintfCIF  THAT'S  THE  CASE:  PRESS  ANY  KEY  TO  CONTTNUE.W); 
fydntfC!!!H!nH!!!!t!!I!!!!!!nHU!!!!H!!!!!!nn!\n'); 
pause; 


end; 


border  » length(nondsessions_3)  +  kngth(nondsessions_2) ... 
+  length(nondsessions_l); 


%%%  pdydat_3: 

if  sae(norvise$sions  J3, 1 )  0, 

load  c:\users\ramin\fcm\muhidim\f^ 

dim  =  length(w’hichfeatures_3); 
f=l:dim; 

Ntemp_3(C0  *  x3(whichfeatures_3(l),  nondsessions_3); 
Dtanp_XC')  -  x3(whichfeatures_3(f),  dcpsessions_3); 

ckarx3; 

end; 

%W4  polydat_2 

if  si2e(no(KUessaons_2>l)  0, 

load  c  :\uscTs\rarnin\fcrn\rnul  ti  dim\ftx2 , 

dim  *  length(whichfeatures_2); 
f=l:dim; 

Ntemp_2(f;:)  «  x2(whichfeatures_2(0,  nondsessions_2); 
Dtanp_2(£,:)  -  x2(whichfeatures_2(f),  dcpsessions_2); 

dear  x2; 

end; 

•/,%%  potydatj 

if  size(nondsessions_l,l)~=:  0, 

load  c:\useis\ramin\fcm\muItidiiTi\fbcI; 

dim  -  length(whichfeatures_l); 

£=l:dim; 

NtempJ(C)  =  x  1  (whichfeatures_  1  (f),  nondsessions  _1); 
Dtemp^KCO  *  x  1  (whichfeatures_l  (0,  dcpsessions_l), 

dear  xl; 

end; 

mitrain_ses3  *  [Ntemp_3*;  Ntemp_2';  Ntemp_l'; ... 

Dtanp_3‘,  Dtcmp_T;  Dtemp^l'); 

howntany  -  sjze(initmn_sess,l); 


fneshCmitnm_sess) , 


%  TWO  FEATURES  AT  A  TIME  -  PLOT  EXAMPLE: 
%pk<mitrain_sess(  1 :40, 1)^nitrain_sess(  1 :40,4V  Y) 
•/.hold  on 

%plot(initram_sess(4 1 :80,l)jnntrainjs«ss(4 1 :80,4),' J*) 


V.  SELECTION  AND  INITIALIZATION  FOR  LMS  FILTER. 

%  The  "mitrain"  data  represents  Polygraph  sessions  which  are  used  for 
•A  INTTaEzatian  and  TRAIN ing  of  the  ‘data_centers"  and  input  data 


%  The  "initrain"  data  are  set  in  a  way  that  the  1st  part  -  before  the 
*/•  "(TC  Jborder"  -  represents  the  non-deeptive  and  the  second  part 
%  -  after  the  ’(TCJborder*  -  the  deceptive  sessions. 

%  The  prefix  "nond"  represents  the  non  deeptive,  and  "dep"  the  deceptive 
%  elements. 


clear, 

%••• 


•/•*•**•**••  TO  BE  SET  FOR  EACH  polydat_i  (ftx3,  ftx2,  ftxl): 

%• 


%•  First  for  the  data_centers: 


nondsessions_3  «  [1:20]; 

%[1  6  8  9  12  16  18  21  24  27  28  32  35  44  48]; 
dcpsessions_3  =  [51:70]; 

%[51  53  58  59  63  67  72  75  82  85  88  89  93  95  100]; 


%• 

• 

nondsessions_2  *  Q. 

dcpsessions_2  “  []; 

•/.* 

» 

nondsessions_l  *  []; 

dcpsessions_l  *  (); 

•/.* 

m 

m 

%m  Now  for  the  input  data  for  which  the  filter  is  to  be  (Drained 
•/**  to  (Qlassify: 

•/.“ 

TC_nondse$$ions_3  -  [1:30]; 
TC_dcpsessions_3  *  [51:80]; 


TC_nondsessions_2  *  []; 

TC  dcpsessions_2  =  []; 

TC_nondse$sions_l  *  []; 

TC  dcpsessions  l « []; 

*/.* 

%• 

%•  And  finally  for  the  selected  features: 

%• 

whichfeatures_3  =  [  1 :30]; 
whichfeatures_2  « []; 
whichfeatures_l  *=  [j; 

•/.*  * 

%•  ATTENTION:  The  DIMENSION  of  each  "whichfeatures_...*  is  to  be  equal!  * 
%*  (or  zero) 

. . . . . M 


if  length(whichfeatures_3)  ~=  length(whichfeatures_2)  ] ... 
length(whichfeatures_2)  -■=  length(whichfeatures_l) , 

^trintfT  H !  M !!!!!!  U  !!>•!!!!  1 !  I !  M !!!!  M !!!!!!  1 !!!!! !  INn1); 
fprintfTCheck  "whichfeatures"!  They  are  different  big  T\n<); 
fyrintfCThe  dimensions  are  as  following  An1); 
fprintfiSn1); 

fyrintff  1st  2nd  3rdW); 

disp(  (length(whichfeatures  l),  length(whichfeatures_2), ... 
length(whichfeatures_3)] ) 

fprintfiAn'); 

fprintfCYOU  DO  NOT  NEED  TO  CHANGE  THE  EMPTY  ONES!\n'); 
fyrmtfCIF  THATS  THE  CASE:  PRESS  ANY  KEY  TO  CONTINUE  An1); 
fprintfC !!  I  !1 !!!!  H !!!!!!!!  11 !!! !! !!  U !!!!!!!!!!!!  1 !! !  !\n*); 

pause; 


end; 

border  =  length(nondsessions_3)  +  length(nondsessions_2) ... 

+  kngth(nondsessicms_l); 

TC_border  *=  Iength(TC_nond$e$sions_3)  +  length(TC_nondsessions_2) ... 
+  lengthOTC_nondsessions_l); 


*/•%%  polydat_3: 

dim  *  kngth(w'hichfeatures_3); 
if  dim  0, 


load  cAusers\ramin\fcm\multidim\ftx3; 
f=l:dim; 

if  length(TC_nondsessions_3)  ~=  0, 

TC_Ntemp_3(C0  *  x3(whichfeatures_3(f)>  TC_nondsessions_3); 
end; 

if  length(TC_dcpsessions_3)  —  0, 

TC_Dtemp_*3(C:)  -  x3(whichfeatures_X0.  TC_dcpsessions_3); 
end; 


if  length(nondsessions_3)  ~=  0, 

Ntcmp_XC0  ”  x3(whichfeatures_3(0,  nondsessions_3); 
end; 

if  Iength(dcpsessi  ons  _3)  0, 

Dtemp_3(C0  *  x3(whichfeatures_3(f),  dcpse$sions_3); 
end; 

clear  x3; 

end; 


%%%  po!ydat_2 

dim  “  length(whichfeature$_2); 
ifdim-“0, 

load  c:\users\ramin\fcm\multidiin\Jbc2; 
f=l:dim; 

iflength(TC_nondsessions_2)  0, 

TC_Ntemp_2(f;:)  ■=  x2(whichfeatures_2(f),  TC_nondsessions_2); 
end; 

if  length(TC_dcpsessions_2)  0, 

TC_Dtemp3(C0  “  x2(whichfeatures_2(f),  TC_dcpsessions_2); 
end; 

if  Iength(nondsessions_2)  ~=  0, 

Ktemp_2(C)  ~  x2(whichfeatures_2(f),  nondsession$_2); 
end; 

if  length(dcpsessions_2)  ~=  0, 

Dtemp_2(C:)  **  x2(whichfeatures_2(f),  dcpsessions_2); 
end; 

dearx2; 

end; 


•/•%•/•  polydaM 

dim  *=  kngth(whichfeatures_l ); 
ifdirn  -■=  0, 


load  c:\xism\ramm\fcm\multidim\ftxl; 
f=l:dim; 

if  length(TC_nondsessions_l)  ~=  0, 

TC_Ntemp_l(f,:)  -  x  1  (whichleaturcs_  1  (f),  TC_nondsessions_  1 ); 
end; 

if  lengthCTC_dcpsessions_l)  ~=  0, 

TC_Dtemp_l(C)  =  xl(whichfeature$_l(f),TC_dcpsessions_l); 
end, 

if  kngth(nondsessions_l)  0, 

Ntemp_l(C0“  xl(whichfeatures_l(f),  nondsessions_l); 
end; 

if  length(dcpsessiom_l)  -•=  0, 

Dtemp  KC)  “  xl(whichfeatures_l(f),  dcpsessions_l); 
end, 

dear  xl; 

end, 

TC_initrarn  *=  [TC_Nlemp_3';  TC_Ntemp_2',  TC_Ntemp_l'; ... 

TCJXempJS*;  TC_Dtemp_2‘;  TCJDtemp_r); 

cent  Jnilrain  *  [Ntemp_3*;  Ntemp_2’,  Ntemp_r; ... 

Dtemp_3‘,  Dtemp_2’;  Dtemp_l']; 


%  LMS  FUZZY  ADAPTIVE  FILTER. 

function  [newtheta,  new_data_centers,  new_sigma,  output_label]  *= ... 
adaptzzyftheta,  data_centers,  sigma,  input_vect,  desire,  step) 

%  fprintfCsizeftheta)?)  ;size(theta), 

%fprintfCsize(sigma) : 1 ^isiz^sigma). 


%  Get  the  dimensions  of  matrices  and  verify  their  consistency. 
flabel_no,  ft_no]  =  size(data_c<nteTs); 

if  (JLabel_no,  ft_no]  si2e(sigma))  |  Q 1 ,  ft_no]  -*=si2e(input_vect))  |... 
Qlabd_no,  1  ]  ~=  size<theta)) 

erroTCmatrix  dimensions  are  wrong!  *) 

end; 
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%  Eviluatc  Gaussian  membershipfunctions: 


distances  *  (ones(label_no,I)  *  input_vect)  -  data_centers; 
%fprintitsi2e(distances):')tsize(distances), 

•/•  To  creat  compatible  dimensions:  Fill  input_vect  down  into  an 
%  (label_no  x  ft_no)  matrix,  so  that  it  is  the  same  input  for  all 
%  (Ubel_no)  rules,  and  then  subtract  data_centers  from  it 

a  -  exp(  -0.5  *  sum{  (  (distances  J  sigma)  .*2 )'  y  ); 

•/•  Without  "sum":  a  -  Uik  i.e.  membership  values 
•/•  etc.etc...(conventional  way) 

•/•+++ 

%fyrintfCsize(a):')^i2e(a), 

%  Centroidal  defuzzification: 
b  «  sum(a);V*fjmntfCsize(b):0^i2e(b)1 
output_label  -  sum(theta  .*  a)  /  b; 

*/.+++ 


%  Adaption: 

tempi  *=  step  *  (desire  -  output Jabel)  *  a  Jb\ 
new_theta  =  theta  +  tempi ; 

temp2  =  (  (tempi  .•  (theta  -  outputjabel) )  *  ones(l,  ft_no»  *  ... 

distances  J  (sigma  A2); 
new_data_centeTS  =  datacenters  +  temp2; 

new_$igma  *=  sigma  +  temp 2  distances  J  sigma; 

%+++ 


return; 


%  LMS  FILTER  INTIALIZATION  (TRAINING  AND  TESTING) 
•/.FIRST  VERSION 

V.  dear  everything! 
clear, 

•/.loading 

load  c:\usm\ramin\fcm\multidim\fbt3; 
which_features  *=1:1 00.% - “to  change! ! ! 


V.  the  data  from  the  ‘person*  who  is  to  be  tested: 
person  =  2; 

testperson  *=  x3(which_features,  person)’. 


polysession(l ,:)  «=  xXw’hich_features,l),;%nondecp 
%%%[xX81,l),  x3(l  1 1,1),  x3(235,l),  x3(450,l)t  x3(452,l)]; 

polysession(  1 6,:)  =  x3(vs,hich_features,100)',%decp 
%*/c%[x3{8  1 , 1 00),  xXH  M 00),  x3(235, 1 00),  x3(450, 1 00),... 
%%%x3(452, 1 00)];  %  polygraph  data  for  two  sessions, 

•/•  i.e  one  truthful  &  one  decpetive 

polysession(2,:)  *=  x3(which_features,48)’,%nondecp 
potysession(3,:)  =  x3(which_features,5)';%nondecp 
polysession(41:)  =  x3(which_features,8)';%nondecp 
potysession(5,:)  «  x3(which_features,9)’,%nondecp 
polysession(6,:)  ”  x3(which_features,  1 2)';%nondecp 
polysessicm(7,:)  -  xXwhich^features,  1 6)*.%nondecp 
polysession(8,:)  *=  x3(w,hich_features,l  8)*,%nondecp 
polysession(9,:)  =  x3(’<,hich_features^  1  y,%nondecp 
polysession(10,:)  *  x3(which_features,24)',%nondecp 
po!ysession(l  1,:)  “  x3(which_fea  tures,2 1)  ,%nondecp 
polysession(12,:)  -  x3(which_feature$,28)'1%nondecp 
polysession(13,:)  *=  x3(which_features32)’;%nondecp 
polysession(14,:)  *  x3(which_features,3  5)*  ;%non  deep 
pdysession(15,:)  «  x3(which_features,44)';%nondecp 

polysession(I7,:)  -  xXwhich_features,95)’>%decp 
polysession(18,:)  *=  x3(which_features,93)';*4decp 
polysession(l  9,:)  =  x3(which_features,89)*;%decp 
polysession(20,:)  *=  x3(which_featiires,88)’;%decp 
polysession(21,:) **  x3(whi  ch_  features  ,85)*  ,%decp 
pdysession(22,:)  =  x3(which_featwes,82)’,%decp 
polysession(23,:)  *=  x3(which_featuxes,75)';%decp 
polysession(24,:)  -  x3(  which_fea  tures  ,72  )*;%decp 
polysession(25,:)  ■=  x3(vvhich_features,67)';%decp 
polysession(26,:)  =  x3(which_features,63X;%decp 
polysession(27,:)  *  xX  vduch_fea  hires,  59)' ,%decp 
potysession(28,:)  *  xXwhich_features,58)‘;%decp 


polysession(29,:)  ■  x3(which_features,53)';54decp 
polysession(30,:)  =  x3(whi  ch_fea  tures  ,51)'  ;54decp 
[howmany,  dim]  *  size(poly session) ,54  "howmany"  must  be  even! 
half-howmany/2; 

ckar  x3;  54save  memory  &  clear 
54+++ 

%initialiation  &  clear 

step  *0.005; 
output- zeros(l,  2) 


output_mean  “[1,2]* 

input_mean  *  polysession; 

input jwidth  =  1  *  ones(howmany,  dim); 

54  Testing(see  100  for  des) 

[dummy,  dummy,  dummy,  output] « ... 
adaptzzy(output_mean,  input_mean,  input  jvidth,  testperson,... 

100,  step); 

V.  Test  how  good  the  output  is  at 
•/•  the  beginning. 


end, 

output 

pause; 

figure(l);clg 
plot(outpuC); 
,/.piot(output_mean/.b'), 
hold  on; 

5imesh(mput_width); 


•/.  SEE  ABOVE  -  SECOND  VERSION. 
•/.User  interface  to  improve! 

54  INITIALIZATION: 

•/.him . . 


step  «  0.5;  54  Learning  factor 

54  The  prefix  "TC"  represents  the  input  data  for  which  the  filter 
54  is  to  be  (Trained  to  (C)lassify: 

TC_howmany  =  size<TC_initrain,  1); 

[howmany,  dim]  —  size(cent_ini  train);  54  representing  data_centers 

clear  output  ; 

output  *  zerosdTCJhowmany,  1  ]); 

•/«  *+r  represents  the  nondeceptive  and  "-1"  the  deceptive  data: 
init_theta_non  *  +  1  *  ones(bordeT,  1); 
init_theta_dcp  -  -1  *  ones((howmany-border),  1); 

output_mean  —  [init_theta_non;  init_theta_dcp];  54  —  data_centers 

input_mean  =  cent  ini  train; 
input_width  -  100  *  ones(howmany,  dim); 

. . . 


54  Before  any  training ... 

54  Test  how  good  the  output  is  at  the  beginning: 

for  k=l:TC_howmany 

if  k<*=TC_border 

des=+l; 

else 

des*- 1; 

end 

[dummy,  dummy,  dummy,  output(k)]  * ... 
adaptzzy(output_mean,  input_mean, ... 
input_width,  TC_initrain(k,:),... 
des,  step); 


end, 

clear  dummy; 
output, 

figure(l);clg 

plot(output,,+'); 
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y^plo^outpuMnean/'b*); 

hold  on; 

pause; 

%mesh(input_width); 

%  Starting  training:  DO  A  BETTER  USERINTERFACE! 

forM:30 

forj“l:5 

for  1  :TC_howmany 

ifk<~TC_border 

des=*+l; 

else 

des=-l; 

end 

[outputmean,  input_mean,  input_width,  output(k)]  *= ... 
adaptzzy(output_mean,  input_mean,  input_width,  ... 
TC_initram(k,:),  des,  step); 

end, 

end, 

output, 

figure(l); 

plot(output,'.');  %axis([  1  1 00  -0.2  2. 1 J); 

*/.plot(output_mean,'*b'); 

*/«mesh(mput_width), 

%pause; 

end; 


% . .  SAVING  THE  FILTER  CHARACTERISTICS: . . 

4«mtfC!!M!!!Il!ltl!!!!!!!!!!!!nin!!!!!!!!!!!!!n!!!!!!I!!!>nO; 

fyrintlTIF  YOU  WANT  TO  SAVE  THE  CHARACRERISTICS  OF  THIS  FILTERS; 
fprintfCPLEASE  TYPE  ANY  NUMBERS  FROM  l-99!\ri); 
fyiintfCTHIS  FILTER  WILL  BE  THEN  SAVED  AS  -filter#"  !\nr); 

clear  numb; 

numb  *  input(The  filter  number^#)  is:1); 

y.  By  default:  numb=(),  i.e.  nothing  win  be  saved. 

if  numb  ~=[], 

numb  =  int2str(numb); 
com  *={*save  'filter',  numb, ... 

1  whichfcatures_3', ... 

'  whichfeature$_2’, ... 

*  whichfeatures_r,  ... 

'  output_mean', '  output_mean', ... 

'  inputmean',  *  input_width']; 

eval(com); 

end; 


%  CREATING  THE  ELLIPTICAL  CLUSTERS  FOR  THE  VISUAL 
y.  INSPECTIONS  -  AND  ALSO  FOR  SITING  THE  RULES. 

function  (x,y}=ellipse(xcenter1ycenter^o<.idth>yv,*idth) 

angle={0:0.02'pi:2-pi]; 

x=xwidth  .*  cos(ang!e)  +  x center, 

y=ywidth  *  sin(angle)  +  y center, 

plotfoy,’-') 


%  TEMPORARY  LMS  SETTING  -  TEST 

function  output_kbeWu2ztemp(input_vect) 
theta  ={  I  1  -1  -if; 

data_centers={  -1  -0.5  ;  0  -0.25  ;  0  0  ;  1  0.3  ]; 
sigmH  0.5  0.8 ;  0.5  0.25  ;  0.1  0.2  ;  0.6  0.5  ); 


*/$  Get  die  dimensions  of  matrices  and  verify  their  consistency: 
flabel^no,  ft_no]  -  cize(data_centers); 

if  (Jlabd_no,  ft_no]  ~~  size(sigjna))  |  Ql,  ft_no]  — »ze(input_vect))  |... 
(pabel_no,  1)  si2e(theta)) 

errorCmatrix  dimensions  are  wrong!  0 

end; 

%-hh- 


•/.  Evaluate  Gaussian  membership  functions: 

distances  *  (ones(label_no,l)  *  input  vect)  -  data_centers; 

a  -  exp(  -0.5  .*  sum(  ( (distances  J  sigma)  *2  )’  )’  ); 


•/*  Centroidal  defuzzification: 
b  -  sum(a); 

output  Jabel «  sum(theta  *  a)  /  b; 
output  Jabel  -  outputlabel  A  2; 

return; 


%  LMS  FILTER  TESTING. 

%  Experimenting  with  the  use  of  adaptive  fuzzy  logic 
y.  in  polygraph  classification. 


init-inputCDo  you  want  to  initialize  all  parameters?  VOi 

ifinit=~y 

%  Initialize  the  parameters  for  fuzzy  LMS  algorithm. 

V.  Output  of  1  means  nondeceptive 
%  Output  of  -1  means  deceptive 
%  length(outputjmean)  «  #  of  rules 
fyrintfi^initializingNn’); 
output_mcan=(  1  1  -1  -If; 

%  input  jmeairJ  centers  of  first  rule ;  centers  of  second  rule ;  etc.  ]  , 
input_mean={  *1  -0.5 ;  0  -0.1 ;  0  0 ;  1  0.3  ]; 

%  input_width-={  widths  of  first  rule  ;  widths  of  second  rule  ;  etc. ); 
input_width={  0.5  1 .3 ;  0.5  0.25 ;  0.1 0.2 ;  0.6  0.5 ]; 

features^ 4 5 1  452];  */•  Select  the  features 

step=0.005;  •/•  Select  learning  rate 


y.  Select  training  data 

ndcp_3=l:  1 5;  %  Kondeceptive  sessions  in  x3  for  training 

dcp_3=5 1:65;  •/•  Deceptive  sessions  in  x3  for  training 

ndcp_2*=n, 

dcp_H); 

ndcp_l=(]; 

dcp_l-[],  %  Note  that  nondeceptive  data  in  xl,  x2,  and  x3 

*/•  are  fire  same,  so  ndcp_2  and  ndcp  l  are  really 
•/.  redundant. 

load  x3; 
load  x2; 
loadxl; 

Ntrain=(x  1  (features /idcp_  1 )  x2(features,ndcp_2)  x3(features,ndcp_3)]’; 
DtrainKxKfeatures.dcp^l)  x2(features,dcp_2)  x3(features,dcp_3)],> 


•/•  Select  testing  data 

ndcp_3={];  */•  Nondeceptive  sessions  in  x3  for  testing 

dcp_3=66:100, 

ndcp_2={]; 

dcp_2={51:100]; 

ndcpj-16:50, 

dcp_T*=[51:100];  %  Note  th at  nondeceptive  data  inxl,x2,  and  x3 

•/•  are  the  same,  so  ndcp_2  and  ndcp_I  are  really 
%  redundant. 


Ntest={x  I  (features ,ndcp_l)  x2(features,ndcp_2)  x3(featuics^dcp_3)]’; 
Dtest=x  1  (features,dcp_l)’; 

Dtest2=x2(features  ,dcp_2X; 

Dtest3=x3(features,dcp_3),; 

clear  xl; 

clear  x2; 

clear  x3; 

clear  record; 

epoch*0, 

end 


%  Test  fuzzy  system  before  any  training 
%  Test  training  data  first 
clear  Nouiput, 
clear  Doutput; 

[N  tr.dumm  y  )=size(N  train) ;  •/•  Ntr « total  U  of  nondeceptive  sessions 

[  Dtr,  dumm  y  }=sizc(Dtrain) ;  •/•  Dtr  -  total  #  of  deceptive  sessions 

ifNtr~“  Dtr 

enor<number  of  nondeceptive  and  deceptive  training  data  mismatch'); 
end 

fori=l:Ntr 

[duriuny, dummy, dumrny,Noutpirt(i)}=adaptzzy(output_rnear4mput_Tnean,.. 

input_width,N  trainO,:),  I  .step); 

[dummy, dummy,  dummy  .Doutput^J^adaptzzyteutpu^meaiynpuMnean,.. 
mput_widthT)train(i,:),- 1  .step); 
end 

%  Record  results 

record(epoch+ 1 ,1  ;2)={(length(find(Nou^put>0)yNtr)  (length(find(Doutput<0)>T>tr)  ]; 
squared_error(epoch+l,l  :2)=[mean((l  -Noutput).A2)  mean((Doutput+  1).A2)]; 
fprintf^percent  correct  nondeceptive  and  deceptive  detections  for  training  data:\rV); 
disp(rttcord(epoch+ 1,1:2)) 


%  Now  test  testing  data 
dear  N  output, 
clear  Doutput; 

[Nte, dummy  }~size(Ntest);  %  Nte  -  total  #  of  nondeceptive  sessions 


fori=l:Nte 


[dummy.dummy, dummy, Noutput(T)l=adaptZ2y(output_meaiUnput_mean,... 
input_width,N  test(l,:),  1  .step) ; 

(Dte,dummyHize(Dt«t);  %  Dte  « total  #  of  deceptive  sessions  in  xl 
fori=l:Dte 

[dummy.dummy ,  dummy  .Doutput©  J=adaptzzy(output_mean^nput_mean, ... 

input_width,Dtest(i,:),-l  .step); 


squared  error(epoch+ 1 ,3 : 4)=( mean((  1  *N  output) . A2)  mean((Doutput+l).A2)], 

[Dte, dummy  }=size(Dtest2);  %  Dte  *  total  #  of  deceptive  sessions  in  x2 
dear  Doutput; 
for  Hl.Dte 

[dummy  .dummy, dummy,Doutput®}~adaptay(output_rnean4nput_rnean,... 

input_width,Dtest2(i,0»- 1  .step); 


squared_erTOf(epoch+ 1 ,5:6)=[mean((  1 -Noutput)  A2)  mean((Doutput+l)  A2)]; 
record(epoch+ 1 ,5:6>=((length(findCNoutput>0)yNte)  Oength(find(Doutput<0)yDte)); 
[Dte,dummyHize(Dtest3);  %  Dte  -  total  #  of  deceptive  sessions  in  x3 
dear  Doutput, 
fori=l:Dte 

[dummy, dumrny.dummy, DoutputtX)}=adapt2zy(output_meanTmput_mean,... 

input_width,Dtest3©0»- 1  ^tep); 


squared_error(epoch+ 1 ,7:8Hmean((l  -Noutput)  A2)  mean((Doutput+ 1)  A2)l; 

rtcord(epoch+  l,7:8)=[Oength(find(Noutput>0)yNte)  (length(6nd(Doutput<0)yDtc)]; 


fyrintfCtrainingpc  1  pc2pc3:\n0; 
di$p(record(epoch+ 1 ,:)) 


•/•  Start  training  and  testing 
fprintffresults  after  trainingXn') 
while  epoch<  1 00000 
epoch=epoch+l 
clear  Noutput; 
dear  Doutput; 

Vo  Training 
fori=l:Ntr 

[output_meanTinput_meaiynput_width1N  output©]=..  . 

adaptzzy(output_m  ean^nput_meanjnput_width, . . . 

NtrainCi,:),l^tep); 

[output_meanjnput_meanjnput_width,Doutput(i)l=s... 

adaptzzyfoutput_mean4nput_mean4nput_width,.  .. 

Dtrain(i,:),-l,step); 

end 

%  end  one  epoch 
%  Test  training  data 
for  i=l:Ntr 

[  dummy, dummy, dummy.N output®  ]=... 

nHapt77y(output_meaninput  meanjnput  width,... 

Ntrain(i,:),l,stepj; 

[dummy,  dummy,  dummy.DoutputO)}3— 

adaptzz><output_mean4nput_mean4nput_wdth,... 

DtrainG,0.*l^tep); 


•/•  Record  results  of  training  data  at  the  end  of  an  epoch 
squared_errotfepoch+ 1,1 :2Hm«m(0-N  output)  A2)  mean((Doutput+ 1)  A2)); 
record(epoch+l,l:2)=[0cngth(find(Noutput>0)yNtr)  Oength(fincl(Doutput<0))'Dtr) ), 


•/•  Now  test  testing  data 
clear  Noutput; 
clear  Doutput; 

[N  te, dummy  }=size(N  test) ; 
fori=l:Nte 

[dummy  .dummy  ,dummy,N  output®  J^daptzzyfoutpu^meanmputjTiean,... 

input_widtk,Ntest(i1:),  1  .step); 


end 

[  Dte,  dummy  }*=size(Dtes  t) , 
for  i=l:Dte 

[dummy, dummy,dumrny,Doutput©}=adaptzzy(output_meaiynput_mean,..  • 

input_width,DtestCu:),- Utep); 


squared_error(epoch+ 1  -Noutput).' A2)  mean((Doutput+  1).A2)]; 

record(epoch+U:4HG«^(find(Noutput>0)yNte)  Oength(find(Doutput<0)yrXe)  ]; 
[Dte,dummy)”size(Dtest2),  %  Dte  -  total  #  of  deceptive  sessions  in  x2 
dear  Doutput; 
for  f=1  .Dte 

[dummy, dummy,dummyX>output(i)}=adaptzzy(output_mean^nput_mean,... 

input_width,Dtest2(i,:),-l  .step); 


squared_eiTor(epoch+ 1 ,5:6Hmean((l  -Noutput)  A2)  mean((Doutput+  1).A2)]; 
record(epoch+ 1  ^:6HO«n«th(^dCNoutput>0)yNte)  Oength(find(Doutput<0)yDte)]; 
[Dte,dummyl=size(DtesG);  V.  Dte  *  total  *  of  deceptive  sessions  in  x3 
clear  Doutput; 
for  f=T.  Die 

[dummy, dummy, dummy  ,Doutput©}=adaptzzy(output_ineaiynput_inean,... 

input_width,Dtest3®0.’l  .step); 


squared  enor(epoch+l  ,7:8)=[nrean(0 -Noutput)  A2)  mean((Doutput+  1).A2)]; 
rccord(epoch+l  ,7:8HOength(find(Noutput>0)yNte)  Oength(find(Doutput<0)yDte)]; 

fprintfl>aining,x  I  tx2rx3:Vi'); 
disp(record(epoch+ 1 ,:)) 

end  %  Go  to  next  epoch 


%  Experimenting  with  the  use  of  adaptive  fuzzy  logic 
%  in  polygraph  classification. 

fortriah=l:l 

%  Initialize  the  parameters  for  fuzzy  LMS  algorithm. 

%  Output  of  1  means  nondeceptive 
%  Output  of -1  means  deceptive 
%  length(outputjmean)  *  M  of  rules 
fyrintfl>iitiafizing\n'); 
output_mean={  11-1  -If; 

%  input_mean={  centers  of  first  rule ;  centers  of  second  rule  ;  etc. }; 
input_mean={  -1  -0.5 ;  0  -0.25  ;  0  0 ;  1  0.3 }; 

%  input_width=c{  widths  of  first  rule  ;  widths  of  second  rule  ;  etc.  ]; 
inpat_width={  0.5  0.8 ;  0.5  0.25 ;  0.1  0.2 ;  0.6  0.5  ]; 

features={451  452};  v  %  Select  the  features 
step=0.005;  %  Select  learning  rate 

tramen=10,  %  Select  ft  of  training  samples  from  each  category 


%  Select  training  data 

tempjv=randperm(50); 

temp_d=5CH-randperm(50); 

ndcp_3=[l:5  7;  10  12  13  15  16  18:20  22  23  25  26  28  29  31  32  34  35  3738  40  41  43  44  46:49]; 
dcp_H51  54  57  60  64  67  70  73  76  79  82  85];%  Deceptive  sessions  in  x3  for  training 
ndcp  2={]; 

dcp_H51  53  56  59  62  65  68  71  74  78  81  84]; 
ndcp_H]; 

dcp_l={51  54  57  59  62  65  68  71  74  77  80  83]; 

%  Note  that  nondeceptive  data  in  xl ,  x2,  and  x3 
%  are  the  same,  so  ndcp_2  and  ndcp_l  are  really 
%  redundant. 

k>adx3; 

loadx2; 

loadxl; 

Ntram={x  1  (features  ,ndcp_l )  x2(features,ndcp_2)  x3(features,ndcp_3)]'; 

Dtrain=(xl  (features,dcp_l)  x2(features,dcp_2)  xXfcatures,dcp_3))'; 

%  Select  testing  data 

ndcp_3=[6  11  14  17  21  24  2  7  30  33  36  39  42  45  50), 

dcp_3={52  53  55  56  58  59  61:63  65  66  68  69  71  72  74  75  77  78  80  81  83  84  86:100); 
ndcp_2={]; 

dcp_2=(52  54  55  57  58  60  61  63  64  66  67  69  70  72  73  75:77  79  80  82  83  85:100], 
ndcp_l=(]; 

dcp_H52  53  55  56  58  60  61  63  64  66  67  69  70  72  73  75  76  78  79  81  82  84:100], 

%  Note  that  nondeceptive  data  in  xl ,  x2,  and  x3 
%  are  the  same,  so  ndcp_2  and  ndcp_l  are  really 
%  redundant. 

Ntest=[xl  (features ,ndcp_l)  x2(featuresrndcp_2)  x3(features,ndcp_3)]'; 

DtesHxl  (features, dcpj)  x2(features,dcp_2)  x3(features,dcp_3)]’, 

clear  xl; 

clear  x2; 

clear  x3; 

clear  record; 

clear  temp_n; 

clear  temp  _d  , 

epoch=0; 


%  Test  fuzzy  system  before  any  training 
%  Test  training  data  first 
clear  Noutput; 
clear  Doutput; 

[Ntr;<hinuny)*=size(N train);  %  Ntr  « total  #  of  nondeceptive  sessions 
(ptr^TTT^my^iyf^TXrairiy,  %  Dtr  -  total  ft  of  deceptive  sessions 
ifNtr-^Dtr 

err orfN umber  of  nondeceptive  and  deceptive  training  data  mismatch'); 
end 

forH:Ntr 

[  dummy ,  dummy  ,dumm  y.Noutput(T)}*a  daptzzy(outpu  t_m  eanjnput^m  ean, . . 
input_width,Ntrain(i, :),  1  .step) ; 

(dummy, durrrniy,dimuny,Doutput(i)}^daptezy(output_meaminput_meajv. 
input_width,Dtrain(i, :),-  l^tep); 
end 

%%  fprintfCResulls  of  training  data  before  training^’), 

%%  Noutput 
%%  Doutput 
%  Record  results 

record(epoch-H,l  :2)^(length(find(Noutput>0)yNtr)  0«ngth(find(Doutput<0)yDtr)  ]; 
fprintfC  percent  correct  nondeceptive  and  deceptive  detections  for  training  data:\nO; 


disp(record(epoch+ 1 , 1 .2)) 


%  Now  test  testing  data 
clear  N  output; 
clear  Doutput; 

[Nte,dummy)=size(Ntest);  %  Nte  -  total  H  of  non  deceptive  sessions 
for  F=l:Nte 

[dummy.dummy, dummy  ,N  output(i)]=adaptz2y(output_meanjnput_mean1... 
input_width,Nte$t(i,:),  1  .step); 
end 

[Dte, dummy  }*=size(Dtest);  %  Dte  “=  total  M  of  deceptive  sessions 

for  i~l:Dte 

[dummy,dummy,dummy,Doutput(r)}=«daptzzy(output_mearvinput_mean,... 

inpuMvidth,Dtest(i,:),*l»stcP)i 

end 

if  (Nte  —  0)<fc(Dte  — 0) 

%%  fyrint^Results  of  testing  data  before  trainingW); 

%%  Noutput 
*/••/«  Doutput 
•/•  Record  results 

record(epoch+ 1  f3^)=={Oen^b(find(Noutput>0)yNte)  (length(find(Doutput<0)yDte) ); 
fprintfCpercent  coned  non  deceptive  and  deceptive  detections  for  testing  dataAn'), 
disp(record(epoch+ 1  3:4)) 
end 


•/#  Start  training  and  testing 
fpiintfCresuhs  after  trainingNn') 
while  epoch<50 
epoch”epoch+l 
clear  Noutput, 
clear  Doutput, 

Training 
for  i=l  :Ntr 

(output_mearv4nput_mean4nput_width,Noutput(i)]=... 

adaptz2y(output_ineaivnput_mean4nput_width,... 

Ntrain(i,:),l,step); 

[output_mean4nput_mean4nput_widthJ)output(f)}=.. . 

adaptzzy(output_mearLrinput_mearynput_width,. . . 

Dtrain(i,:),*l,step); 

end 

%  end  one  epoch 
*/•  Test  training  data 
for  i=l:Ntr 

(dummy,  dummy,dumrny,Noutput(i)}=... 

adaptzzy(output_meanjnput_n>ean^nput_width,... 

NtrainCi,:),l^tep); 

[dummy  .dummy, dummy,Doutput(?))=... 

adapta^output_meanjnput_meanjnput_wid  th, . . . 

DtrainOi:)»*l»steP)'» 

end 

%%  fprintfCresuIts  of  training  dataW) 

•/<,*/«  Noutput 
%•/*  Doutput 

%  Record  results  of  training  data  at  the  end  of  an  epoch 

record(epoch+l,l  :2)=[0cngth(find(Noutput>O)yNtr)  Ocngth(find(Doutput<0)yDtr)  ]; 
fprintfCpercent  correct  nondeceptive  and  deceptive  detections  for  training  dataAn') 
disp(record(epoch+ 1 , 1 :2)) 

if  (Nte  —  0)&(Dte  —  0) 

•/•  Now  test  testing  data 
clear  Noutput, 
clear  Doutput; 
fori*=l:Nte 

[dummy, dummy, dummy,Noutput(i)}:aadaptzzy(oiitput_mearynput_mean ... 
input_width,NtestCi,:),l  ^tep); 
end 

for  i=I:  Dte 

[dummy, dummy  ,dummy,Doutput©}**daptzzy(output_rnean4nput_rnearv.. 
input_width,Dtestfi,:),- 1  .step); 
end 

%%  fyrintftVesuhs  of  testing  data\n*) 

%%  Noutput 
%%  Doutput 

record(epoch  + 1  ^3:4)={  (length(find(N  output  >0)yN  te)  (length(find(DoutpuKO)yEXe) ); 
fprintfl^  percent  correct  nondeceptive  and  deceptive  detections  for  testing  data:\nr) 
chsp(record(epoch+ 1 3:4)) 
end 

end  %  Go  to  next  epoch 

maximum(triaI>=max(record(:3)+Tecord(:,4)); 

temp={find((record(:3)+rct^d(:.4))==TnaximuTn(triaI))'  0  0  0  0  0); 

maxiina(trial,l:5)=temp(l:5); 

maxima(triai1 1 :5) 

maximum^ 

end  %  Go  to  next  trial 

m  aximumi=m  axim  um/2 


EPILOGUE  -  Motivation,  challenges  and  risks 


I  was  easily  fascinated  by  the  idea  of  a  lie-detector  at  the  very  first  moment  I  heard  about  it.  I 
thought,  'we  are  not  supposed  to  lie  anyway  and  a  lie-detector  can  help  us  find  and  prevent  a 
major  part  of  the  crimes  committed  in  our  society.  I  became  even  more  motivated  to  do  this 
research  by  an  innovative  way  of  pattern  recognition,  namely  the  fuzzy  approach. 

But  very  soon,  I  also  began  to  realize  its  danger  -  while  juggling  with  numerical  data  and  being 
far  from  the  reality  of  testing  actual  human  beings  and  judging  them  by  an. algorithm. 

An  example:  Too  ’good'  detection  rates! 

In  my  project,  I  obtained  in  certain  cases  up  to  97%  correct  detection  rate.  That  is,  indeed,  an 
impressive  number.  However,  the  emphasis  lies  on  "certain  cases"  -  not  only  in  this  thesis. 

A  non-technically  oriented  user  of  such  a  product  is  tempted  to  put  too  much  trust  into  these 
kinds  of  high  rates.  Even  if  we  have  a  stable  lie-detector  with  99%(!)  correct  detection,  this  still 
means  that  one  out  of  100  persons  will  be  judged  incorrectly. 

In  our  daily  life,  we  do  not  have  the  natural  skill  to  "see"  who  is  deceptive,  but  some  biological 
and  psychological  features  that  enable  us  to  estimate  whether  and  to  what  degree  someone  is 
lying.  This  is  exactly  what  1  have  exploited  in  this  project.  In  fact,  even  the  fuzzy  approach  is 
similar  to  the  human  way  of  categorizing  someone's  deceptiveness  in  soft  terms  like  "She  lies 
seldom"  or  "He  is  often  deceptive",  instead  of  hard  labeling  like  "She  is  truthful"  or  "He  is 
deceptive". 

After  all,  I  am  convinced  that  no  lie-detector  -  even  if  it  could  work  easily  with  different 
polygraph  formats,  and  is  perfect  in  technical  terms  -  can  ever  be  constructed  with  such  a  high 
detection  rate63  that  one  coidd  judge  a  person  without  any  witnesses  or  other  additional 
inquiries.  We  may  only  use  a  lie-detector  as  a  helpful  "objective"  tool,  but  never  as  an  ultimate 
decision  maker. 

My  initial  goal  was  to  be  aware  of  this  responsibilty  and  not  to  lose  the  global  perspective  while 
dealing  with  technical  details.  I  hope  1  have  accomplished  this. 

I  also  hope  for  an  environment  where  we  do  not  judge  people  who  hurt  us,  but  do  forgive  them. 
In  that  case,  we  ourselves  are  forgiven  too,  since  all  of  us  deserve  to  be  judged,  don't  we! 


Ramin  Djamschidi 
San  Jose,  September  1994 . 


^See  e.g.  chapter  4.3.  for  "Outlier  effect"  and  "Performance  limitations". 
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NON-DECEPTIVE  DATA 


_ KEY _ 

*  standard;  CODE.Olt,  012, 013, 021  022, 023, 031, 032, 033 
t#  Index:  error  message  in  MATLAB  reads, 

>>process 

“Index  exceeds  matrix  dimensions. 

»Enor  m=*>c:\users\ulka\non\extractf.m 

on  line  48— *>  start  -  begin(i)  +  30  *times(first_channeU); 

»Error  in=>c:\usersVuikaWn\process.ni 
on  line  6—^feature  -  extractfTz,  feature  list);" 

AreadJ:  CODE.Olc,  .02c,  .03c,  .023,  .033,  .011,  .021,  .031,  .013 
confusing  as  to  how  to  READ3  these  files 

***N/A:  discs  were  unable  to  be  processed 

AAextra:  CODE.Q41,  .042.  .043  processed  as  t4 _ 
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1 


NON-DECEPTIVE  DATA 


ERS 

SUB# 

CODE 

#  OF  FILES 

EXTRA  FILES 

ERRORS 

1 

2 

$$EACOWO 

standard* 

none 

none  • 

1 

4 

$$EAD5LX 

standard 

none 

none 

1 

6 

$$EANWKF 

13 

0.005 

none 

1 

8 

$$EAOZD6 

standard 

none 

none 

1 

9 

$$EAQWB9 

standard 

none 

none 

1 

11 

$$EARKZ6 

standard 

none 

none 

1 

12 

$$EARJS0 

standard 

none 

none 

1 

13 

$$EA%KR9 

standard 

none 

Index**  t3 

1 

15 

$$EA%H#L 

standard 

none 

none 

1 

18 

$$EB2IYL 

standard 

none 

none 

1 

22 

$$EC4QN3 

standard 

none 

none 

1 

26 

$$EC7N7X 

standard 

none 

none 

1 

33 

$$ECLMTU 

standard 

none 

none 

1 

34 

$$ECMA%C 

standard 

none 

none 

1 

35 

$$ECM7GX 

standard 

none 

none 

1 

36 

$$ECMWB3 

standard 

none 

none 

1 

40 

$$EC#G20 

standard 

none 

none 

1 

43 

$$EC$OOF 

standard 

none 

none 

1 

44 

$$ED805U 

standard 

none  : 

none 

1 

45 

$$ED8LUI 

standard 

none 

none 

1 

46 

$$ED9439 

9 

read3A 

N/A*** 

1 

47 

$$ED9TCX 

standard 

none 

none 

1 

50 

$$EDBQR2 

standard 

none 

none 

1 

53 

$$EDCZYZ 

12 

extraAA 

none 

1 

59 

$$EDPY4# 

standard 

none 

none 

1 

60 

$$EDQCY9 

standard 

none 

none 

1 

61 

$$EDQ28X 

standard 

none 

none 

1 

62 

$$EDQOCF 

standard 

none 

index  tl 

1 

65 

$$EDRKGO 

standard 

none 

none 

1 

66 

$$EDRMU# 

standard 

none 

none 

2 

11a 

$$FZIMEU 

13 

.005,  extra 

index  tla 

2 

11b 

$$FZISQ# 

standard 

none 

none 

2 

12 

$$FZIT4L 

standard 

none 

none 

2 

14 

$$FZJ52# 

standard 

none 

index  tl 

2 

30 

$$FZZN1Y 

10 

0.005 

index  t3 

2 

32 

$$FZ#D6J 

10 

0.005 

none 

2 

33 

$$FZ#0HX 

13 

.005,  extra 

div  by  zero  t3 

2 

35 

$$FZ$3A& 

standard 

none 

none 

2 

36 

$$F#8CY9 

11 

.005, .STR 

none 

2 

38 

$$F#9FJL 

10 

0.005 

index  t2,  t3 

2 

41 

$$F#B6SC 

standard 

none 

none 

2 

42 

$$F#B6C# 

standard 

none 

none 

2 

45 

$$F#NMDX 

standard 

none 

index  tl 

2 

47 

$$F#NHQT 

standard 

none 

none 

2 

48 

$$F#&7GC 

standard 

none 

index  t3 

2 

51 

$$F#QJTF 

standard 

none 

none 

2 

52 

$$F#S0KR 

standard 

none 

none 
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NEWS.XLS 


ERS 

SUB# 

CODE 

#  OF  FILES 

EXTRA  FILES 

ERRORS 

47 

2 

53 

$$F#RRD5 

standard 

none 

none 

48 

2 

54 

$$F#RYFR 

12 

extra 

index  t3 

49 

2 

55 

$$F#SALQ 

10 

0.005 

index  t3 

50 

2 

56 

$$F$C#2# 

standard 

none 

none 

51 

3 

2 

standard 

none 

none 

52 

3 

12 

$$F$I41X 

11 

.005, .STR 

none 

53 

3 

25a 

$$F$IUY0 

10 

0.005 

none 

3 

25b 

$$F$UI3X 

11 

.005,  .STR 

none 

54 

3 

31 

$$F$WNSF 

standard 

none 

none 

55 

3 

43 

$$F%51&G 

10 

.STR 

index  tl 

56 

3 

46 

$$F%5$UF 

standard 

none 

none 

57 

3 

49 

$$F%7K#0 

standard 

none 

none 

58 

3 

59 

$$F%JAK6 

standard 

none 

none 
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DECEPTIVE  DATA 


KEY 


OlOil-ttlCO  N3 


DECEPTIVE  DATA 


SUB# 


la 


1b 


1c 


5 


CODE 


$$G3#SGD 


$$EACLB6 


$$G3$6HN 


$$EAN#XO 


$$EAOQXV 


$$EAQ%%U 


$$EB0289 


$$EA%%MX 


|$$EB2WE$ 


$$EC4%GO 


$$EC77GI 


$$EC760R 


$$ECIX9# 


$$ECIVB0 


$$ECJHKO 


$$ECJVSI 


$$ECJ#Z$ 


$$ECLODC 


$$ECXAPG 


1$$ECYCG0 


$$EC#$FA 


$$EC$ANC 


$$ED9$N# 


$$EDB$S3 


$$EDCSRC 


$$EDDBUX 


$$EDCBSU 


$$EDDHTI 


|$$EDP26U 


$$EDQYMF 


$$EDR3XI 


$$EDS3ZL 


$$FZ3Z5S 


$$FZ3XG6 


$$FZ52G6 


$$FZ6&46 


$$FZ7B#C 


$$FZ7GP# 


$$FZIMEU 


$$FZJ358 


$$FZL9ZR 


$$FZLBY& 


$$FZMQ#C 


$$FZMW$H 


$$FZWQQC 


$$FZW5T# 


|$$FZYCM& 


#  OF  FILES 


standard* 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


11 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


12 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


17 


10 


10 


standard 


10 


10 


standard 


standard 


13 


none 


none 


.005,  .STR 


none 


index  t3 


format 


none 


none 


none 


none 


none 


index  t1,t2 


index  t3 


none 


none 


extra,  .005 
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5-6 


1 


